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Abstract 



The variety of surface forms that may be used to convey a given speech act pose a 
major problem in modelling task-oriented (and other) dialogues. Many such forms are so- 
called indirect speech acts, that is, surface form does not correspond to the (or one) intended 
speech act. While this topic has received extensive attention from linguists, their concerns 
have not usually been computationally motivated. In this paper, I present a non- 
computational analysis of indirect speech act forms with an eye to computational 
considerations. 

The paper is divided into two parts. Part 1 presents categories and rules for indirect 
speech acts, justified where possible by traditional linguistic arguments. The second part of 
the paper draws a set of computational implications from the material presented in Part 1. 
This is done within the general framework of a process model of recognition. Part 2 
contains a discussion of the basic types of mechanisms needed for the classes of indirect 
speech act identified in Part 1. The discussion includes an examination of the dependencies 
between processes and an initial categorization of the types of knowledge that must be 
considered in interpreting indirect speech acts. 
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1. Introduction 



1.1. The Are* of interest 



The variety of surface forms that may be used to convey a given speech act pose a 
major problem in modelling dialogue. 1 Consider, for example, some of the different ways to 
ask someone else to write a computer program for you: 



1.1 I request that you write a program to manipulate blocks for me. 

1.2 Write a program to manipulate blocks for am = 

1.3 Would/Will you write a program to manipulate blocks for me? 

1.4 Could/Can you write a program to manipulate blocks for me? 

1.5 I want you to write a program to manipulate btedis for me. 

1.6 I want a program to manipulate blocks written, 

1.7 I want a program to manipulate blocks. 

1.8 I would like you to write a program to manipulate blocks for me. 

1.9 I would like a program to manipulate blocks written. 

1.10 I would like a program to manipulate blocks. 

1.11 Give me a program to manipulate Macks. 

1.12 Would/Will you give me a program to manipulate blocks? 

1.13 Could/Can you give me a program to manipulate blocks? 

1.14 I want you to give me a program to manipulate blocks. 

1.15 i would like you to give me a program to manipulate blocks, 

1.16 Write me a program, would/will you? 

1.17 Write me a program, couW/can you? 



1. 1 am grateful to William A. Martin and members of the Knowledge Based Systems Group 
at the M.I.T. Laboratory for Computer Science. Their work provided the foundation on 
which this paper is based. The OWL-I knowledge representation formalism used for the 
process model discussed in Part 2 and the approach to Interpreting procedural 
representations are products of the efforts of this group. I also wish to thank BUI Mark, 
Candy Sidner, Peter Szotovits, and Jim Weiner for helpful comments on drafts of this paper 
and Ashok Malhotra, William Mann, and Louis Midi* for making dialogue transcripts 
available to me. 
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Utterances I 3 to 1 17 are commonly called indirect speech acts, since the surface forth does not 

correspond directly to the intended speech act This is not a complete list but is intended 

only to illustrate the variety of forms that are possible Similar lists cdU« be made for other 

speech acts Requests seem to have the largest number of possible forms, but warnings. 

suggestions, and questions also appear in a number of variations. In fact, all speech acts 

with the exception of some ceremonial acts seem to have at least some indirect forms. 

For a theory to adequately deaf with indirect speech acts, (henceforth, ISAs 2 ) it must 

contain at leaM the following components: 

I A characterization of the forms that may be ssed I© con*** 
individual speech acts 

? A desciiotmn erf rnechanism(s)torefaite3urfao; forms to 
underlying speech act** 

3 An account of the conversational impbeatiom of choosing 
a partKiilar surface form 

Parts I and 2 of this paper relate, respectively, to th* first two of these goals. In Part I I 

present a group of distinctions intended as the core of a taxonomy Of ISA forms (Goat I). 

The categories proposed draw on insights from, but not necessarily the theoretical 

framework of. previous work in this area (eg.. 5a«oc*v Searf* JW»d Gordon and Lakoff). 

The categories differ in important ways, however, from existing proposals. Although the 



2 I hope that the similarity between this abbreviation of indirect speech act and IS- A, a 
name used commonly in the Artificial Jmefhgertce literature for a hierarchic semantic 
relationship, is not too distracting. The letters of ISA are pronounced mdivkiuam/, not as 

an acronym 

3. Here and elsewhere in the paper, I use surf ace form to mean simply a Written or spoken 
English utterance. 



II 

choice of these categories was strongly motivated by computational considerations, they are 
supported by independent linguistic evidence as welt This use of non-computational 
evidence to justify computationally attractive categories is a methodological strength of the 
presentation. 

Part 2 of this paper focusses on the second area ctted above There I concentrate on 
recognition and describe aspects of an ISA process model. ISA recognition cannot be 
completely divorced from general issues of recognition, and I have set the process mode* 
within a larger framework ISA processing can, however, be viewed independent of a 
particular implementation, and that has been done here. 

The third area cited -above, accounting for the 5 conversational implications of choices, 
wilt not be given a systematic treatment here due to the scope of the problem. Some work 
in this area has already been done, e.g., that by Lafcoff 115] and Davison [7]. 

My account of ISAs will be restricted to task-oriented dialogue, which was defined by 
Grosz as "one in which two (or more) people communicate for the sole purpose of 
completing some task" {8}. 4 I would extend this deftnttiow beyond people to indude 
communicating systems (i.e. computers). A good model of task-oriented dialogue would 
have obvious practical advantages in the realm of human-machine interaction. Beyond 
this, task-oriented dialogue has some methodological advantages First, the task being 
carried out not only gives structure to the dialogue but abo gives valuable independent 
information to the outsider trying to interpret dialogue transcripts Information about the 



4. Although this definition opens up the possiblity of more than two participants, I will 
restrict my attention to two-party conversations. 
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task can be particularly usefuMn ascribing goals and motivations to dialogue participants. 
Second, the conversational implication «>£ choosing a particular ISA form (Coal 3, above) 
are generally less cuiciaJ to task-oriented dialog tie than, say, to the therapeutic dialogue 
analyzed by Labov and Fanshel [Ml This does not mean that the implications of chokes 
can be ignored in an ISA model, but it does mean that work on modelling task-oriented 
dialogue can proceed without waiting for a full systematic treatment of the implications of 
choices. 

In developing this treatment of IS As, I have concentrated on the areas of similarity 
between typewritten and spoken utterances, which means that I will have little to say about 
features unique to speech, e.g.. intonation. Certainly more work on the place of special 
speech mechanisms is warranted. The commonalities between wraten and oral modes are a 
good place to start, however, because there is a good deal of lexical and grammatical 
variation to be accounted for in ISAs. 

Two additional remarks: first, examples are drawn from several; different sources. A 
number of examples are drawn from the ISA literature, and some have been taken from 
transcripts of both typewritten and spoken dialogues. The rest of the examples are 
manufactured ones, to avoid the necessity of long scene-setting explanations and to more 
easily highlight the phenomena that are of interest. Although I have been able to find 
counterparts for some of the manufactured examples in dialogue transcripts, this has not 
been the case for all of them. In general, I leave the empirical |ustificatioo of the proposed 
ISA framework to further research and appeal instead to the linguistic intuitions of the 
reader. 
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The second remark relates to representation A major focus of this paper is on rules 
and pieces of semantic representation that together, I claim, can be shown to account for a 
large variety of ISA forms The rules and semantic representations are presented here in 
informal English, although they have been translated into a knowledge representation 
scheme called 0*1.-1. 1 feel that semantic representations in general are still in their 
infancy; we still have only an imperfect idea about what elements should be included in a 
semantic representation and why. Because of this, little would be gained by the Use of the 
notation here The loss of precision in using English is hopefully offset by the greater 
intelligibility of the presentation and the obvious applicability Of the fSA rules across 
representation schemes. 

I am. then, considering ISAs in task-oriented dialogue; and I will be working 
primarily with the commonalities that exist between written and spoken forms. Before a 
more detailed survey of what the paper will include. 1 will list some of the areas that wilt 
not be considered. 

1.2. Excluded Topics 

There are two sets of topics that will be excluded from consideration. The first relates 
to phenomena that I do not consider to he ISAs The second set of topics relates to 
legitimate ISAs that will be ignored in an effort to limit the scope of the paper. 

We start with the non ISAs. ISAs will be categorized by their intended underlying 
speech acts, so this excludes speech acts that are not intended by the speaker/sender. PI. 



5. To simplify discussion of dialogue exchanges, I will call the speaker/sender of the first 
utterance considered PI and the hearer/receiver P2. If subsequent related utterances are 



If 

For example, consider the case where PI intends a simple description of a personal wish but, 
on heating PTs utterance* P2iteats.it as a request and set* about t» make the wish a reality. 
Whatever thtHesult of PCs utterance, if PI did not intend the utterance as a request, then I 
wttt not consider it one. 

Another sort of utterance that will not be considered to be an ISA is one where there 
is an additional implicit sense arising only from a particular cooccurence of dialogue steps. 
For example, in a computer console session environment, users sometimes type "Thank you" 
in a place where other users type Thank you" followed by "Good-bye" I do not consider 
"Thank you", when it occurs alone in such a situation, to be an indirect closing. Instead, I 
consider the closing to be an optional step, which may be omitted in the presence of 
utterances that uniquely identify the place in the dialogue. A similar optional step would be 
a statement acknowledgment such as "I understand," "Uh huh" or "O.K." If the next 
utterance is one that shows that the statement has been understood, an acknowledgment is 
not strictly necessary The criterion I am using to distinguish ISAs from utterances 
preceded or followed by omitted optional steps is that ISA forms are derivable from 
conditions associated with the conveyed speech aet(s). Utterances that "imply" omitted steps do 
so based on relationships at a level of dialogue structure more aggregate than speech acts.** 
Consequently, just because one utterance Ul implies another, U2> tb& does not mean that VI 



discussed, then PI and P2 continue to refer to ihe same partjcipafiU. The only places that I 
depart from this convention is in reporting on other theories. In these cases, I follow the 
terminology used by the author of the theory. 

6. See Section A for a discussion of aspects of this level of dialogue structure. I suspect that 
application of the framework described there could lead to good progress in characterizing 
the conditions that permit omitted steps. 



IS 

should automatically be classed as an ISA with the force conveyed by U2 as an underlying 
speech act 

This takes care of forms that will not be considered ISAs. and now I briefly list 
phenomena that, although interesting, are beyond the'scopeof thli paper. The flfst of the 
excluded phenomena are cases in which rules or conventions are Violated. Examples are 
deliberate violations such as insincerity and accidental ones such as mistakes. In addition, I 
will not discuss what I consider to be second order uses of speech acts, such as sarcasm, 
jokes, or failure to make standard choices (eg:, Pf makes an utterance and has not decided 
whether it is a question or a request for a nonverbal action).' Finally, 1 have excluded tme 
particular cla<s of ISAs, those identified by Fraser as hedged performatives [9$ This class 
of ISAs, although extremely interesting, iseems to be of relatively minor importance for the 
type of dialogue 1 am considering. 

A final area that is considered only partially is context. Linguistic and situational 
context plays a major role in both the generation and the understanding of ISA forms. The 
role of context clearly cannot be ignored. At the same time, a fuH account of the nature of 
context is a massive undertaking, and one that has implications far beyond ISA forms. 
This paper, then, contains less than the full story on context, although I have given it a 
relatively large a mount .of attention See in particular SecttonK. 



7. One interesting second order use of ISAs is in what can be called force shift. In these 
cases, one speech act form is used to "masquerade" as another. For example, one may use a 
suggestion form such as "How about picking up thebtocks now?" in an environment where 
authority and role relationships make it clear that the utterance is functioning as a 
command. In general, force shift seems to be used to give PI the appearance of greater 
benevolence or to save face for P2. 
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1.3. The Organization of the Paper 

I will finish this introduction with a brief description of the organization of the 
paper. Part I begins with a description of the major points of speech act theory that have 
affected the model presented here In Section 2 J am particularly concerned with issues of 
taxonomy, since an account of ISA* depends en clear boundaries between classes of speech 
acts Section ? gives a detailed view of the speech acts used as examples throughout the 
papei. and faction 4 introduces the (WIhI method, a structure for representing actions. 
The preliminaries aside. Sections 5 through 7 focus on ISAs. Section 5 discusses three 
different theories of ISAs, and the next two sections present a theory that builds on all three 
of these, whMe also accounting for additional phenomena. In particular, Section 6 presents a 
set of gene) al IS A rules Section 7 shows how rthese rules must be restricted and augmented 
to account for the ISA data. This section identifies a set of categories that account for some 
of the major differences found among ISA forms. 

The computational implications of the ISA taxonomy are developed in Part 2. The 
central thesis of Part 2 is as follows: 



The phenomenon of indirect speech acts is too complex to admit to a single, 
uniform computational treatment. |t is necessary, then, to identify dasses of indfeec* 
speech acts that share similar computational properties and use different 
representations and processing strategies for the different classes. 



By no coincidence at all the candidates for differing computational treatment are exactly the 
categories developed in Part I. 

White recognition of ISAs has its own unique aspects, it cannot be viewed entirety 
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separately from the whole problem of recognizing utterances. For this reason, after some 
addiuonal introductory remarks in Section 8, Section 9 discusses the points in the 
recognition process that have the most direct relevance to the processing of IS As. Section 10 
presents general observations on ISA processing in the light of the recognition framework 
that was described, and Section II contains specific proposals for a mechanism. Finally. 
Section 12 is an initial characterization of the knowledge sources that come into play in 
framing and interpreting ISAs. 
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PART 1 
THE TAXONOMY OF INDIRECT SPEECH ACTS PROPOSED 
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2. Defining and Organizing Speech Acts 

2.1. Some Major Points from Speech Act Theory 

Any treatment of IS As necessarily rests on a general theory of speech acts. 
Fortunately for the theory, but unfortunately for would-be summarizes, the speech act 
literature is extensive Since I cannot hope to adequately summarize it here, 1 will mention 
only those aspects of the theory that are reflected directly in the account of ISAs to be given. 
Readers unfamiliar with speech act theory are referred to Austin (ft Searle [29], and, for a 
wide-ranging bibliography. Verschueren {34]. More specialized references will be cited in 
the course of this section. 

The central thesis of speech act theory is that a strict dichotomy between speech and 
action is untenable. Before Austin, much of the philosophical interest in language centered 
around its descriptive uses and the conditions under which statements could be judged right 
or wrong. While acknowledging this constative use of language, Austin identified a class of 
utterances which he called performatives. Standard examplfs of explicit performative 
utterances are: 

2.1 1 name this ship the Queen Elizabeth. 

2.2 I bet you sixpence it will rain tomorrow. 

Once 2.1 is uttered under the proper circumstances, the ship officially has a name, and, once 
22 is uttered, a bet has been proposed. The important point about these and other 
performatives is that saying something is also doing something. 

Austin's central insight that language is action was taken up and developed further in 
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the work of Seark?. In [29] Searle identifies five type* of constitutive ruin for speech acts. 
Constitutive rules create or define new forms of behavior, as opposed to regulating or 
sanctioning behavior. Searle's five types of rules, when taken together, are Intended tegive 
necessary and sufficient conditions for performance of a speech act. The fhre types ..of rules 
are 



I. Rules shared by all speech acts are those based on normal input and output 
conditions, which Searle describes as "the range of con d i tion s under which any kind 
of serious and literal linguistic communication is possible." 8 



2. Prepositional content rules describe constraints on the prepositional content of 
speech acts. The proposition! content of a speech act is, very informally, what the 
speech act is "about." For example, the prepositional content aim request is "a future 
act A of H." (H is Searle's abbreviation for hearer. S. which appears below, stands for 
speaker.) 



3. Sincerity rules are those based on conditions that must occur for the speech act to 
be performed sincerely The sincerity rule for request It that the request is uttered 
only if "S wants H to do A." 



4. Preparatory rules Ire those based on. conditions that must obtain initially lor the 
speech act to be performed successfully. An example for request is that the request is 
uttered only if "H is able to do A." 



5. Essential rules are based on what Searle sees as the essential feature of the speech 
act, one that distinguishes it from other similar speech acts. For requests, the essential 
rule is that the request "counts as an attempttogetH todo A." 



Throughout the work on speech acts, there is a bask tension between underlying 
speech acts and the words used to carry out or describe them, commonly called speech act 

8. [29lp.br 
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verbs (henceforth SAVs). Verbs such as ask, request, conpalulate, pro****, apologize, etc. 
can be used to carry out the action conveyed, white other verbs such a convince, persuade, 
and intimidate are used descriptively. 9 Austin offers a preliminary taxonomy not of speech 
acts but of SAVs Searle, who presents a taxonomy ** speech act* &ee (3U), points out lhat 
different SAVs do not necessarily convey different speech acts; some SAVs, for example, 
differ only in the manner of carrying out the speech act. SearieV example, whkh is 
somewhat controversial, is announce, since one can announce orders, promises, and reports. 
Another example might be the use of remark and emphasize to reflect differences in delivery 
of, potentially, the same information. 

What are the implications of these ideas for a computational theory? First/the view 
that speech is action means that, as a minimum, speech acts should to be represented hi a 
way that reflects the commonalities with other sorts of actions. Ai the same time; the 
representation must be able to reflect the special properties of speech (and written 
communication) that distinguish speech acts from other sorts of actions. FinaHy, we want to 
maintain a distinction between speech acts and SAVs as a source of information about the 
boundaries of speech acts. All of these themes will play an important role in the treatment 
of ISA forms presented here. 



9. This latter group of verbs focusses not on the speech act (the illocutionary act), but on the 
effects of the speech act on the hearer. Austin calls the acts conveyed by such verbs 
perlocutionary acts. It seems to me to be desirable, however, to emphasize the commonality 
of the speech act between, say argue and convince. These refer to the same type of act, but 
for the latter, the effect is descibed as well. See [6] for a similar treatment of perlocutionary 
acts within a computational framework. 
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2.2. Half of an Approach to Taxonomy 

Austin estimated the mimtxr of 5AV$ to be brtww* one and teh thousand 
Assuming that the number of speech acts h Hbo of thh order, then eWarty a taxonomy is 
essential to a general theory of speech acts. R rums out ttrtf a taxonomy, or at least a 
principled approach to one. K abo crucial to a treatment of ISA forms If one wishes to 
claim that forms associated with <** speech act are used to convey another, then one must 
be quite sure that the two speech acts are really dufmct. Otherwise, the "indirection" is no 
indirection at aH. 

In this subsection I present an approach to taxonomy that combines aspects of earlier 
work with some flew additions Before plunging m. I wish to emphasite that this is not 
«een as the onJy speech wt wgamzation pejjrt*. nor is « seen as the onry computationally 
useful one. fc is, however, one that plays a Jignifkant role m IS A organization. Section 3 
presents additional speech act categories that wM be used in the IS A analysis 

For th* bask approach w taxonomy, I fake a high level set of categories from 5earle 
but use a defining criterion derived from wot* by VerKboeren At the most general level 
of speech act categories, I start with the f ivt ctaw* identified by Seark? Oft representatives, 
directives, commissives, expressives, and declarations. Searle basaa these five classes on the 
"iltocutionary point" of the act, that is. on "differences in the point (or purpose) of the (type 
of) act." Because of the importance of these speech act classes. I wM give Searte's definition 
of each, excerpted from [311 
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Representatives (eg., asserting): The point or purpose of the members of the 
representative class is to commit the speaker (in varying degrees) to something's being 
the case, to the truth of the expressed proposition. 



Directives (e.g., requesting). The illocuuonary point of these consists in that fact that 
they are attempts (of varying degrees and hence more precisely, they are determinates 
of the determinable which includes attempting) by the speaker to get the hearer to do 
something. 



Commissives (e.g.. promising): Commissives...are those illocutionary acts whose point is 
to commit the speaker (again in varying degrees) to some future course of action. 



Expressive* (eg., thanking): The illocutionary point of this class is to express the 
psychological state specified in the sincerity condition about a state of affairs specified 
in the propositions! content. 



Declarations (e.g.. nominating a candidate) It is the defining characteristic of this class 
that the successful performance of one of its members brings about the 
correspondence between the prepositional content and reality... 



These five classes give an excellent start toward a computationally useful taxonomy of IS As. 
The notion of illocutionary point, however, has been criticized by Yersfjhueren |35J for its 
combination of illocutionary (i.e.. act-related) and perlocuUonaryOe^ effect-related) 
properties. (The definition of directives mentions the hearer explicitly, but. other categories 
do not.) Verschueren views SAVs as reflecting the speaker's intention to bring about some 
effect in the hearer by pronouncing an utterance. 

Verschueren's focus on the hearer can be profitably applied to speech acts as wen as 
SAVs; the one addition that I would make is that some speech acts can have more than one 
intended effect on the hearer, so that the classification criterion must be according to the 
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principal effect that the speaker (PI) intends to hum on the hearer (P2). Using this 
approach. Searle's five categories can be redefined as fbtews* 



Representatives have as their principal intended effect that P2 come to know that PI 
beheves something. " 



Directives have as their principal intended effect that P2 take responsibility for 
carrying out some action. 12 



Commissives have as their principal intended effect that P2 accept Pi's commitment 
.to take responsibility for carrying out some action. 



Expressives have as their principal intended effect that P2 come to know that PI feels 
something. 



Declarations have as their principal intended effect that P2, and possibly society as 
well, come to perceive and accept a change in reality. 



We have, then. Searle's speech act categories and a defining criterion based on a 
suggestion by Vefschueren. The criterion is suited to a computational approach because it 
points in the direction of larger patterns of action. It takes into account both participants in 
a dialogue and foreshadows responses to the speech act (l.e. whether the speaker's intention 



10. The first three criteria are closely patterned after similar ones given by Vefschueren in 

11. I me "come to know" rather than "know" here to indicate that I am defining the principal 
intended effect as an action rather than simply as a state. Expressives, commissives, and 
declarations are given the same treatment. 

12. 1 use take responsibility here to indicate that P2 may either carry out the action or 
subcontract some or all of it. 
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is satisfied, is not satisfied, or whether the response falls somewhere in between). I will come 
back to these issues in Section 4. 

This takes care of the top of the speech act taxonomy. Besides Its use in forming the 
most general categories, the "principal intended effect" criterion can also be used to form 
intermediate levels of speech act classes. Although I have found such categories useful for 
other purposes, they are not necessary for the ISA analysis. I merely mention their 
possibility and go on in the next subsection to the other half of the taxonomy, the question 
of how to distinguish speech acts from each other. 

2.3. The Other Half 

This brings us to the level of speech acts. 13 Despite all the work that has been done 
in speech act theory, I believe that we stiff lack a totally satisfying answer to the question of 
how one speech act is to be distinguished from another. Searle's constitutive rules give a 
way to distinguish speech acts from other sorts of actions and give a framework for 
expressing distinctions between speech acts, but they do not specify exactly where lines are to 
be drawn between different speech acts The "principal intended effect" criterion gives a 
necessary condition for differentiating between speech acts, but it does not seem to be 
sufficient for the purposes of studying ISAs. For example, warn and command are generally 
considered to be different speech acts, but both can be viewed as attempts by PI to get P2 to 
take responsibility for carrying out some action. Granted, warnings as a class have the 



13. I will refer to generic speech acts as, simply, speech acts. Realizations of a generic 
speech act will be referred to as particular speech acts. 
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extra restriction that the action is to avoid some condition that is perceived by PI to be 
against P2*s best interests. There is nothing to stop one, Itowevei, front c o mmand t ug the 
same sort of action Tints, an utterance such as 2.3, might be construed as a warning or a 
command, depending on whether authority or "good sense" is being appealed to. 

2.3 Keep off the ke. 

At such borderline cases, the difference seems to rest on the nature of PZY Migttion to take 
responsibility for the action. Warnings seem to b* irn p uw e i e d ibf'an appear n» P2 to* act 
reasonably, i.e. in P2's own interest. Commands are empowered by the authority 
relationship that exists between PI and P2. If command and warn nwm be distinguished 
based ttm the nature -of-the relationship appealed to, men the >rmcipal intended effect" 
criterion is, clearly not jufftctent. I need, then, a princi p fcd > app toa «h for the speech act level 
of the- taxonomy. 

i start the search for a criterion wtth the conviction that the resulting set of speech 
acts should look familiar. They should not be radically different from the categories found 
in the literature and/ more aa^l^ Engluh and other languages. The 

resulting system should allow the use of fenwhae terms each-aa wmtn. tawmm nd, request, am, 
reU, etc. 

After classifying a number of speech acts. 1 have found no one simpJe criterion that 
yields the familiar set of speech acts There am, howevti; thtee»ya)dmmesilor what 
constitutes a speech act that I have found useful: 
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1. SAVs suggest a minimal group of speech acts, if one omits. 

a general SAVs such as say, which can refer to more than one speech act (where 
"speech act" is delineated by application of some other guideline). 

b. context-marked SAVs such as reply, interject, and answer, which mark the place 
of the utterance in the discourse.'* 

c. aggregate SAVs such as describe which refer to more than one speech act acting 
as a unit. 

d SAVs that differ only according to Pi's view of the importance of the speech act, 
e.g., mention used for stating. 



2. Sets of linguistic realizations that are substitutable for each other without seriously 
disturbing the flow of a dialogue should have a separate speech act with which they 
can be associated. 



3. Since particular speech acts occur as part of larger linguistic and non-linguistic 
activities, speech acts should be chosen so that they are easily integrated into these 
larger patterns of action. 



No matter how refined these guidelines become, they probably cannot be made to 
function independently. That is, no one of them can be turned into a criterion that is 
sufficient to clearly discriminate between all the speech acts that one would expect. With 
respect to SAVs, at least in English there are candidates for the status of speech act that 
have no corresponding SAV. For example, there is a separate set of surface forms for 
giving instructions, and so, according to guideline 2, one would want to identify a separate 
speech act. There seems to be no English SAV that expreittfs the notion of giving one- 



14. In barring the referents of context-marked SAVs from independent speech act status I 
differ with Searle [31]. The difference is not particularly serious and means only that the 
class of independent speech acts that I am admitting is slightly smaller than his. 
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utterance-worth of instructions instruct is either a request or an aggregate.) SAVs, then, 
provide only a minimal guideline. 

The difficulty with Guideline 2 comes in narrowing down the notion of "seriously 
disturbing the flow of dialogue," since probably no utterance forms are ever completely 
synonymous The choke of one form over another seems to always Involve some 
implications, e.g.. politeness, formality, combativeness, etc. There seems to be a grey area 
between characteristics of utterances that should be accounted for bj "implications of choke" 
and those that should be accounted for by assuming a difference in conveyed speech act. 

With respect to the third guideline, "ease of integration" is almost certainly not a 
simple property. The process of representing larger patterns of action is open to influences 
beyond those local to the speech act. Frequently, altogether different acts of speech acts can 
be supported if different choices are made at other places in the re pr e senta tion. (For 
further discussion of larger patterns of action, see Section 4.) 

Even though these guidelines cannot function independently and in some cases they 
will still leave some grey areas, they can still have important implications for a 
computational theory. A set of speech acts that follows the guidelines will have the 
following properties: 



1. The speech acts will be easily related to English SAVs. 

2. The speech act representations wttl provide a useful place to associate information 
about groups of linguistic realizations. 

3. The speech acts will fit into larger patterns of action, allowing us to model not only 
isolated speech acts but also speech acts as they occur in dialogue. 



The speech acts used in rules or examples in this paper are introduced in Section 3. They 
represent an application the three guidelines given here After working with them for some 
time, I believe tharthey display the three advantages listed above. These advantages begin 
to be apparent in the discussion of the computational model in Part 2. 

2.4. The Notion of a Precondition 

In defining speech acts with ISAs in mind, it is useful to shift the emphasis slightly in 
the treatment of constitutive rules. I will therefore introduce the notion of a precondition of 
a speech act. Preconditions are a combination of Searle's sincerity conditions and 
preparatory conditions viewed from the point of view of Pl lb (where PI is the agent of the 
speech act) Because Pi's point of view may not correspond to reality, particular speech acts 
may meet all preconditions and still be defective in the sense used in Searie [291 The 
preconditions that I am interested in are those that differ from one speech act to another. 
For this reason. I wiM ignore general requirements such as the requirement that a given 
speech act must be intended by PI. 

Although the actual representation that I am assuming for the, preconditions, is QWL- 
I, for the purposes of this paper 1 present the preconditions in informal English. Despite 
the informality, some words are used in a restrided sense and several simple conventions 
hav« been followed. 1 will discuss these conventions and some of the restri€tton*«4iere, and 
the rest of the restrictions will be explained as examples are presented. 



15. Searle's essential conditions are not directly reflected in preconditions, although they seem 
to be derivable from them. I see eseotiakconditions a* an amalgam of several different 
types of information, some of which would be explicit in preconditions and the rest of which 
would probably be implicit. 
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The first convention is that PI in preconditions refers to the speaker/writer of the 
speech act and P2 refers to the hearer/reader. Speech act conditians centered on intentions 
of Ft (m the philosophical sense of inltntkm) will haw pfeeondtticro that correspond 
directly, i.e.. 

Pt <intention> <object of intention> 

eg., PI wants some action done. 

All other speech act conditions are recast with respect to Pi's model of the world, i.e, 
PI believes <speech act condition 
eg.. PI believes that P2 can take responsibility for a (particular) action. 

This takes care nf the conventions; two restrictions are also of interest, want and 
Mitvt For both types of preconditton, r have represented goats of participants by the verb 
went, i.e., "PI wants <state or actienV* or It believes that P2 wants <state or action>". 
Befape is more complicated. As used in preconditions it it broader than kmrnr, that Is, the 
direct object of Mitvt may be either a fact or an opinion. By "PI believes X" 1 mean that 
PI has seme world model that contains X and PI beHev« that X corresponds to an aspect 
of reahty. By "correspond to an aspect of reality" I mean that acttom based on X reach the 
expected results or. rf they do not. the deviation from the expectation is not caused by X. 
As the philosophical Hteramre attests, much more remain* to be said about facts, opinions. 
and reality. To avoid infinite •- or at least lengthy -regress, I will cut off the discussion 
here and appeal to the reader's intuitions. 

The notion of a precondition is operationally useful in a computational theory. The 
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preconditions of a speech act are groups of conditions that are expected to obtain before the 
speech act is initiated. As such, they should be intuitively obvious to speakers of the 
language. One pair of linguistic tests for "preconditionhood" relate to the way that P2 may 
claim that a failure has occurred in Pi's speech act. For speech acts centered on intentions 
of PI, P2 may reply with form 2.4. For others, form 2.5 is possible. 

2.4 You don't really <intention>. 

e.g.. You don't really want me to do that. 

2.5 What makes you think <direct object of initial "PI believes">? 

eg.. What makes you think I can tell you? 

Note that these tests are necessary but not sufficient to distinguish the preconditions 
particular to a speech act. They also apply to conditions associated with speech acts in 
general. (See Section 6.3 for some of these general conditions.) 

Several examples of the sets of preconditions associated with different speech acts are 
found in the next section, and the usefulness of the concept of a precondition for ISA 
analysis will become apparent in Section 6. 



This empty page was substituted for a 
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3. The Speech Acts Used in the Examples 

In this section we take a closer look at four commonly occurring speech acts. These 
will be used as examples throughout the paper. The examples are drawn from two of 
Searle's categories that are particularly relevant for task-oriented dialogue: directives and 
commissives. (Representatives, also very important in task-oriented dialogue, are treated as a 
class for reasons given below.) 

The four speech act examples are presented here in terms of two additional categories, 
which will be useful later on in the presentation of the ISA categories. (Recall the warning 
that Searle's five categories were seen as one of several useful divisions of speech acts.) The 
split that 1 am making is on types of prepositional content. The two speech act classes will 
be called information-centered and action-centered speech acts; they are defined in the two 
subsections below Note that the two categories do not cover all the speech acts in Searle's 
five categories. While it would be possible to extend the alternative categorization scheme, 
this is not necessary for the purposes of this paper. 

3.1. Information-Centered Speech Acts 

Information-centered speech acts are those whose intended effect is the transfer of 
information between PI and P2 with no additional participants involved. Information- 
centered speech acts include Searle's representatives and other types of speech acts such as 
the directive ASK. 16 For the purposes of this paper, I will need only ASK. While 



16. Here and elsewhere, I will capitalize the names of speech acts that are used to refer to 
the speech act definitions given in this section. Small letters wHI be used for references to 
other definitions. 
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representatives will gitay an important role in many aspect* of the ISA analysis, I wilt .notice 
focussing on ISAs of individual representatives. The process of drawing distinctions 
between individual representatives raises many interesting and important philosophical 
issues, which are, unfortunately, beyond the scope of this paper. 

The propositional content and the preconditions of ASK are stated informally In 
Figure 31 We have said that ASK is a directive, i.e., an attempt by PI to get P2 to take 
responsibility for carrying out some action. In this case, the action desired is that P2 give 
PI some information, the answer to a question.' 7 As 1 have defined ASK. it requires that PI 
want the answer (precondition I). To be consistent. PI must also believe that he or she does 
not know the answer; this is part of the definition of want. The first precondition, then, 
means that certain types of questions are not classed under ASK. Among these are test 
questions and ceremonial questions where PI already knows the answer. 

The preconditions in Figure 3.1 should be self-explanatory, with the possible exception 
of (V). The notion of obligation used here is a more specific version of the generalized 
obligation that Labov and Fanshel use for requests in [141 (I am using the same set of 
obligations for REQUEST as for ASK; further discussion of the relationship between these 
two speech acts is given below.) An example of an obligation arising from complementary 
roles would be the obligation that, applies in an interviewer/interviewee relationship. 
Authority obligations are slightly more difficult to identify, since, especially in contemporary 
American society, most authority arises from roles. Still, it is possible to identify authority 



17. 1 appeal to intuitive notions of question and answer here, although much more can - 
and eventually should -- be said, it appears that a specification of the semantic structure of 
questions and a full treatment of the semantics of answers will be sizable tasks. 
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Figure 3.1. The preconditions for ASK 
propositional content: some question 

I. PI wants to know an answer to the question. 

II. PI believes that P2 can tell an answer to the question. 

III. PI believes that P2 is willing to tell an answer to the question. 

IV. PI wants P2 to tell PI an answer to the question. 

V. PI believes that P2 has some obligation (a rote obligation, authority obligation, 
or general obligation to be helpful) to PI to tell PI an answer to the question. 

Comments 

(I) Want here and in (IV) implies that PI does not already know the answer. The case 
where PI does know and merely wants to know if P2 knows (and where P2 knows that PI 
knows) is classed as a different speech act. 

(I) Know is considered to be a restricted form of believe; while anything can be beJieved, 
only facts can be known. (For ASK, the "fact" is that some proposition is the answer to the 
question.) Fact, of course, needs its own definition which I <tew* have space to go into 
here; instead, I appeal to intuition and observe that facts are most strongly contrasted with 
opinions. 

(II)-(V) Tell is used here to mean "utter a representative." 

(V) More than one of these different types of obligations may apply at once. 



3$ 

obligations that, while they may be carry-overs from rein adopted '*f *he participant* do 
not arise from current complementary role relationships. For example, if a teacher 
encounters a student in the halls of the school and the teacher asks a question, the student is 
obliged to answer by an authority oMfgarten. Cetia*^. the teather'i author ity can be seen 
as arising from the role, but. in this context, the student need not necessarily be a student of 
this teacher for the obligation to apply. Thus, the authority obligation stems from each 
participant playing some role, but the rote rHatt omM p it not complementary in tkt same 
way that it is for teachers and students in a classroom situation. . 

Winding up the discussion of obligations, we come to the "general obligation to be 
helpful" Being hetpf ill here roeam that otw pteyta part in furtr»«TWiflr*orlier individual's 
goals. Examples of appeals to this obligation would be an ASK to a stranger for the time 
or an ASK to a friend for the football scores. This type of obligation seems to arise simply 
from the inequality of knowledge between the participants; if P2 knows something that PI 
wants to know, then P2 is obligated to 1* hetpfer and snare «he knowledge TMs 
obhgation is not absolute (nor are role or authority obligations), since it may be overridden 
by other obligations. 

To finish up the drtcuwion of ASK, note thai ASK shows strong similarities to 
REQUEST, which is discussed in the next subsection. REQUEST is used primarily for 
nonverbal actions. The distinction between ASK and REQUEST is based on the second 
and third guidelines from Section 2.3. 18 With respect to Guideline 2. ASK has the 



18. The first guideline seems fairly inconclusive in this case. True, the SAV ask can be used 
in the following two patterns: "asking someone something" and "asking someone to do 
something." These two do not. however, correspond completely to a distinction between 
describing an ASK and describing a REQUEST. For example. 1 asked Harry If he would 
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interrogative as one of its direct forms (what I am calling its sttfplefdrm; see Section 7.2). 

REQUEST has only the imperative as a comparable form. With respect to Guideline 3, we 

see differences in the patterns of action in which ASK and REQUEST take part. One 

example is the following pair of responses: 

34 i cmY 

3.2 I can't teH you. 

Here. 31 can be a response to a direct <i.e. imperative) REQUEST, but it cannot be used 
synonymously with 3 2 as a response to a direct (i.e. interrogative) ASK. 

All this is not meant to deny that ASK and 1 REQUEST are closely related. 
Comparison of Figure 3.4. below, with Figure 31 shows strong similarities in the 
preconditions of the two speech acts. These strong similarities do not, however, necessarily 
justify viewing ASK as a kind of REQUEST. Instead, the similarities can be* seen as a 
result of the common membership of ASK and REQUEST in the directive class. 

3.2. Action-Centered Speech Acts 

An action-centered speech act is defined as a speech act that has as its prepositional 
content an action Al of a type to be specified where Pf wants Al to occur. Here, Al is either 
non-verbal or a verbal action directed at someone other than PI.™ 



lend me a pencil" may be used to report a REQUEST, and "I asked Harry to teW me the 
answer" may be used to report an ASK. 

19. Besides the normal "active" actions I include as actions the maintenance of a state, e.g., 
being noisy, and the avoidance of an action, e.g., being quiet: The definition of action- 
centered speech acts is meant to exclude verbal actions such as Masking P2 for information 
but to include PI asking P2 to give information to a third patty. P3. The first case is 
already covered by ASK, as defined above. I consider the second case to be action-centered 
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Action-centered speech acts, then, are acts that attempt to generate an action, not 
merely refer to one (as in, for example, thanking someone for some action). Note that the 
definition above does not specify that the Mended effect fee the principal one. The 
implication of this is that action-centered speech a&t are not restricted to the directives. 
The action-centered speech act class aho includes members of the commissive and 
declaration classes where one of the intended effects is to cause the type of action specified 
above The action centered speech act class, then, is formed using "intended effect" a* a 
criterion and cuts across Searle's five major speech act classes. 

The action-centered speech acts used as examples in this paper are REQUEST, 
SUGG EST. and OFFER . I wiU summarize the major differences between them first and 
then give the preconditions of each. The first difference is the nature of the action expected 
of P2 For OFFER, the expected action is -,m& of accepting another action. For 
REQUEST, R2 is expected to taie responsibility for carrying out an action desired by PI. 
and, finally, for SUGGEST P2 is expected to consider a plan of action. A second difference 
between the three speech acts lies in the obligations invoked. REQUEST depends on 
appeals to one or ropfe of the three types of obligation, discussed for ASK. while OFFER 
and SUGGEST depend on a combination of self-interest. obligations and obligations to "be 
helpful." The choice of obligation for OFFER and SUGGEST is disewsed further below. 



rather than information-centered because it is much closer to a nonverbal action: P2 must 
not only evaluate hi*or her obligation to P3 vis a vis the information ,<P3'$ right to know) 
but P2 must also evaluate his or her obligation to PJio be r the con veyex of the information 
(Pi's right to request). This level of complexity is closer to action-centered than to 
information-centered speech acts. Note that my argument here is based on Guideline 3. I 
am appealing to larger patterns of action to draw speech act boundaries, which. then 
percolate up into class boundaries. A case can also be made based on Guideline 2: the 
simple interrogative form may be used for directives involving verbal actions with PI as the 
destination, but tt may not be used for verbal actions with a third party as destination. 
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Taking each of the three example speech acts in turn, I start with OFFER. Although 
promising is the favorite commissive in the speech act Mteraftife. OFFER will be considered 
here because it has a rich ISA structure. It is also a speech act that is, in a sense, doubly 
action centered. An OFFER conveys, first, Pi's desire to perform a service for P2 or to give, 
lend, etc something to P2. Second. OFFER conveys Pi's desire that P2 accept the service, 
gift. loan, or whatever. The preconditions given- for OFFER in Figure 3.2 clearly reflect 
this split 

The appearance of the "he helpful" obligation in OFFER (VII) needs explaining. 
The concept that 1 am trying to express is Pi's obligation to accept P2's OFFER. At first 
glance, it seems enough to say that P2 has a general obligation to act in his or her own self 
interest (precondition VHl). This obligation, coupled with the fact that the propositional 
content of an OFFER is an action expected to benefit P2, gives rise to an obligation for P2 
to accept the OFFER There seems to be more than simple pragmatism, however, involved 
in accepting OFFERS The obligation to accept is not merely P2V obligation to further his 
or her own goals, but also P2's obligation to PI to further Pi's goals (ie., P2's obligation to 
be helpful, precondition VII) In accepting an OFFER. P2 is enhancing Pi's image as a 
benevolent person. Pi's satisfaction in giving, etc. By accepting, then, P2 is furthering Pi's 
goats and being "helpful" 

The next action-centered speech act that we come to is SUGGEST (Figure 3.3). 
Recall the distinction above that SUGGEST is a directive that P2 consider an action, not 
necessarily that P2 carry it out Note that this treatment of SUGGEST is supported by the 
common use of a response such as 3.3. 
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Figure 3 2 The preconditions for OFFER 



proportional content: some action that PI believes will be of benefit to P2 

I. PI wants to take responsibility for the action. 

II. PI believes that PI can take responsibility for Pi's part of the action. 
HI. PI is willing to take responsibility for Pi's part of the action. 

IV. PI wants P2 to perform some action that complements Pi's part of the action. 

V. PI believes P2 can perform some action that c omp lements Pi's part of the 
action. ,,,,: <••? .■--.-. 

VI. PI believes that P2 wouW be wiUing to perform seme action that complements 
Pi's part of the action. 

VII. PI believes that P2 has an obligation (to "be helpful") to PI perform some 
action that complements Pis part of the action. 

VIII. PI believes that P2 has an obligation [to P£(by virtue at P2's own self-interest) 
to perform some action that complements Pi's part of the action. 



Comments 



OMUI) "Take responsibility" is used here and elsewhere to permit subcooJracting, Whether 
PI actually performs the action of not, he or she still remains responsible for the results. 

(IV)-(VHI) Examples of complementary actions would be (physically) taking food OFFERed 
by a hostess or getting into a car and sitting in response t0jm«lpfj|E£ of a ride from a 
friend. A general way to refer to P2's performance of a com pl e ment ary action in response to 
an OFFERed action is to -say; that P2 acctpHd, eg., "jane thanked Paula and accepted the 
gift." 
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Figure 3.3. The preconditions for SUGGEST 

propositions) content: an action 

except for: actions in which PI and P2 share joint agency 

I. PI wants P2 to consider taking responsibility for the action. 

II. PI believes that P2 can consider taking responsibility for the action. 

HI. PI believes that P2 is willing to consider taking responsibility for the action. 

IV. PI believes that P2 has an obligation (to "be helpful") to PI to consider the 
action. 

V. PI believes that P2 can take responsibility for the action. 

VI. PI believes that P2 is willing to take responsibility for the action. 

VII. PI believes that there are some reasons why the action is desirable. 

VIII PI believes that P2 has an obligation to P2 (by virtue of P2's own self-interest) 
to consider taking responsibility for the action. 

Comments 



(propositional content) In excluding actions where PI and P2 share common agency, I am 
merely arguing for a separate speech act. eg.. SUGGEST-C(^M0&-ACTi0N, to cower 
such cases. I justify this distinction by an appeal to Guideline 2: this second speech act has 
some forms not shared by simple SUGGEST, e.g., "Let's <action>." 
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3.3 That's a good idea. 

Note that the action to be considered by P2 may be a verbalstetton directed at PI. For 
example, to help P2 solve a problem, PI might SUGGEST 3.4. 

3.4 Why don't you tell me what you've done so far. 

The unrestricted nature of the action suggested does not, however, threaten the status of 
SUGGEST as an action-centered speech act. Thills because the action in the propositional 
content is that P2 consider the second action, and considering something is a nonverbal 
action 

The obligation preconditions for SUGGEST (IV and VIII) are similar to those for 
OFFER. Once P2 has acknowledged that an action is desirable (precondition VII), there is 
a default obligation that P2 consider taking responsibility for carrying out the action (VIII). 
SUGGEST also appeals to P2's obligation to PI to help further Pi's goals (e.g., enhanced 
self-esteem and recognition for PI). The obligation arises from the fact that a goal of PI is 
involved in a SUGGEST (precondition 1) This obligation is generally not as prominent for 
SUGGEST acts as for OFFERs, but it still plays a role. Consider, for example/the 
politeness constraints involved in responding to a SUGGEST. A response such as 3.5 
violates the obligation in (IV). 

3.5 That's a terrible idea. 

The third action-centered speech act that will be considered here is REQUEST, the 
directive by which PI attempts to get P2 to take responsibility for carrying out some non- 
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verbal action or verbal action not directed at PI. One other class of actions that I am 
currently excluding from REQUESTS is remembering previous REQUESTS; that is to say, 
REQUEST as I define it does not include reminders. This can be justified by an appeal to 
Guideline 1 due to the difference in SAVs request and remind as well as by an appeal to 
Guideline 3. (Reminders are often followed by apologies or thanks by P2, where simple 
R EQUESTs are not.) Figure 3 4 lists the preconditions for REQUEST. 

In defining REQUEST. 1 follow Labov and Fanshel in citing an obligation. Just as 
for ASK, I further specialize the notion of obligation into authority, complementary role, 
and "be helpful" obligations ^ Another point that is worth mentioning is that my treatment 
of obligation subsumes the notion of Pi's right to invoke the obligation. (See [14],.p.78) The 
obligation stated for REQUEST and ASK is a three-place relationship between PI, P2, and 
the thing that P2 is obliged to do. PI may be part of a larger set, so that the particular 
obligation may be to both PI and the society in general, eg., the obligation to drive 
carefully. Given this formulation, PI has the right to invoke the obligation because PI is 
one of the parties to the obligation. 



This takes care of the speech act examples that will be used in this paper. There is 
one more piece of foundation that must go into place before the taxonomy of ISAs can be 



20. For REQUEST and, to a lesser degree for ASK, 1 have been tempted to define three 
different speech acts, one for each type of obligation. While there is some Guideline 2 
(difference in surface form) evidence for this, the bulk of the argument would have to rest 
on Guideline 3 (action pattern) evidence. The modelling of dialogue patterns is at too early 
a stage for Guideline 3 evidence taken alone to be conclusive, so I stay with the more 
standard treatment of REQUEST. 
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presented. Accordingly, the next section presents a general representation scheme for 
actions 
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Figure 3.4. The preconditions for REQUEST 

prepositional content: a non-verbal action or a verbal action whose destination is not PI, 
except for: the action of remembering a previous REQUEST 

I. PI wants P2 to take responsibility for carrying out the action. 

II. PI believes that P2 can take responsibility for carrying out the action. 

III. PI believes that P2 is willing to take responsibility for carrying out the action. 

IV. PI believes that P2 is obligated to PI (and possibly to others) to take 
responsibility for carrying out the action. (This obligation may be a role obligation, 
an authority obligation, or a general obligation to be helpful.) 

Comments 

(I) "Want" implies that PI believes that the action has not already been carried out. 

(III) At least for the purposes of reasoning, "willing" here includes P2 not being grossly 
unwilling. 

(IV) More than one of these different obligations may apply at once. 
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4. Representing Procedures: The OWI.-I Method 

In this section I discuss a general representation scheme for actions and patterns of 
actions The discussion will add new perspectives to the treatment of taxonomy developed 
in the last two sections. More important for my purposes here, this section will also form 
the foundation of the analysis of ISAs that is presented in Sections 6 and 7. Note that the 
scheme for representing actions described here is not the only one that has been proposed. 
I defer a discussion of some of the alternatives until Section 8.2, where a comparison of 
static representations can be combined with a look at the way that the representations are 
used in language processing. 

I start the discussion of patterns of action with OWI.-I, which is a formalism for 
representing knowledge. The basic unit of OWI.-I is called a concept, and concepts are 
related to each other by a network of indices. The important point about concepts for this 
paper is that they can be grouped into larger structures. The structure of interest here is 
called a method 21 Methods provide a high level, hence relatively declarative, representation 
for procedural knowledge. The high level nature of the representation means that methods 
can be used as the basis of an explanation facility (see Swartout [32]). Moreover, this 
property in combination with the highly interrelated nature of OWI.-I concepts means that 
method structures can be a very important aid in searches of a knowledge base. My 
discussion of methods will avoid issues of notation. English translations are used for 



21. OWI. methods were initially defined by William A. Martin. OWI.-I is the version of OWL 
that was used in the dialogue project; OWI, has continued to develop since that time, and 
OWI.-II is the current version. For an imreducttoW to OW1.-I, see Szotovits et al.[33) 
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examples, and the reader interested in OWI.-I notation is referred to Hawkinson[l3] and 
Brown[3]. 

With respect to content, methods are used to represent both linguistic and non- 
linguistic actions. For example, they can be used to represent an action such as writing a 
computer program for someone else, which normally contains both linguistic and non- 
linguistic steps. Methods can also be used to model totally non-linguistic activities such as 
baking a cake. 2 *- In the examples I have considered and in the rest of this paper, however, I 
will focus on what will be called core dialogue methods. This is a group of semantic domain 
independent methods built around speech acts. (Core methods may, however, contain some 
non-linguistic steps, eg., to represent mental processes.) An example of a core method is 
request-and-respond, which appears in an English translation in Figure 4.1. (The details of 
this example will be explained below.) English translations of some other core methods can 
be found in Appendix A. 

Implicit in the preceding discussion is the point that methods may have one 
participant or several The core dialogue methods model the parts played by both 
participants in the dialogue, a property I call speaker independence. Methods can therefore 
be used either from the point of view of a non-participant to model both parts heard (or 
read) or from the point of view of a participant, in which some utterances are generated 
and some understood. 

Looking at the structure of methods in more detail, we start with the three main parts: 
a header, argument specifications, and a procedural body. The header is the method's 



22. See Long [17] for the use of OWI.-I methods to model programming knowledge. 
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REQUEST-AND-RESPOND 

p BJECT: an action that can be the object; of R EQU EST 

exceptions: I. a helping action 2. a repetitive action 
AGENT; a person or computer system 
CO -AGENT: a person or computer system 

method: 

1. The AGENT REQUESTS the action of the CO-AGENT. 

2. The ^ CO-AGENT now knows the action desired, 

3. The CO-AGENT checks to see whether beliefs from the preconditions 
of the REQUEST are justified. , , * 

4. The CO-AGENT acknowledges the REQUEST. 

5. The CO-AGENT, at the appropriate time, takes responsibility for 
carrying out the action. 

6 PRINCIPAL ^RESULT: Any results of the action desired obtain. 

recovery path I: if the CO-AGENT can't do wb*tth?jAGENT says -- 
If the REQUEST is based on a role obligation and the action 
REQUEST©* isnqt in one of the CO?AGJENT*s,fOles witfe respect to 
the AGENT -- 

Rl.l The CO-AGENT says the action is not in hisor l*«J-ole 
R1.2 If the CO-AGENT wants to be helpful 
and - 
if the CO-AGENT knows a likely participant to take 

responsibility for the action desired 
then 
the CO- AG EJ£F refers the AGENT to this 4»kely participant, 
recovery path 2: if the REQUEST was framed in a general way 
and the CO-AGENT can ©rdf do a rnwe specific wrstonr 
R2.I The CO-AGENT describes the specific version that he or she 

can do. 
R2.2 If the more specific description matches the AGENT's goal -- 
then 
the AGENT says that is what he or she wanted 
and 

REQUEST-AND-RESPOND continues using the new description 
otherwise 
dialogue failure: the CO-AGENT can't do what the AGENT wants. 



Figure 4.1. An English representation of a method for REQUESTing an action 
and getting a response. 
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unique name. Argument specifications, organized by semantic cases (see below), are used for 
type checking of inputs to the method (input tests) or to specify the form of results (output 
cases). The procedural body is divided into two parts: (optional) prerequisites and 
procedure steps The prerequisites of interest here are those that are states. A stative 
prerequisite of an action is a condition that must obtain before that action is carried out. If 
the condition does not hold, then one must bring it about before carrying out the action. 

The other part of the procedural body is the set of procedure steps, which come in 
two basic varieties, standard path and recovery path. This distinction wiH be discussed 
further below, but, basically, standard paths represent the ways that an exchange can "go 
right," while recovery paths give some of the possible measures to be taken when an 
exchange gets "off the track." A recovery path may be initiated when an expectation set up 
by an ongoing method is violated. 

Since REQUEST is the speech act from which I will draw most of the examples. I 
win use the request-and-respond pattern to illustrate the discussion of methods. In Figure 
4.1, request-and-respond corresponds to the header of the actual OWI.-1 method. Request- 
and-respond takes three input cases - OBJECT. AGENT, and CO-AGENT - and one 
output case. PRINCIPAL-RESULT 23 AGENT and CO-AGENT are the two participants 
in the method; by convention, the AGENT of an entire dialogue method is the AGENT of 
the first step, so that we can identify the AGENT of the method with the participant who 
starts it off. The OBJECT of request-and-respond is the action desired by the AGENT. 



23. Semantic case names will be capitalized to avoid confusion with the normal use of these 
words. There should be no confusion with speech act names, which I am also capitalizing. 
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2 * Input case specifications typically contain variables, which are Wee other concepts except 
that their composition follows special rules. The fact that other concepts may be bound to 
variables permits representations to be evaluated with respect to some environment. (This is 
important when methods are used in recognition; see Section 9.) One property of input case 
specifications omitted from Figure 4.1 is their use as a place to associate further entry 
conditions, i.e., conditions that must be met before the method is applied. This point is 
discussed further below. 

Note that input cases are associated with methods, not surface English verbs. A list of 
the major OVI.-1 input cases, with informal explanations was written by William A. Martin 
and is reproduced with minor changes in Figure 4.2. I refer to these as the major cases 
because I do not see Figure 4.2 as an exhaustive list. There will be a significant number of 
inputs to actions that this list does not cover. Many of these can be fitted into the basic 
framework as more specialized versions of the major semantic cases. It appears, however, 
that some actions have totally idiosyncratic inputs which win have to be treated as special 
cases (in both senses of case). 

Besides input case specifications, I said that OWI.-l methods may have associated 
output case specifications, i.e. results. One important notion here is that of principal result, 
which is the main result of the method and, typically, the reason that the method is 



24 The OBJECT of request-and-respond excludes helping actions, i.e., those in which the 
AGENT and CO-AGENT divide responsibility for the action. These are represented in 
another core method, request-and-help (see Append*** A.) Also' excluded are repetitive 
REQUESTS, e.g., a standing order for publications. More effort must be put into 
bookkeeping so that the repetitive REQUEST can be responded to repeatedly. The steps 
differ enough to warrant the use of a separate method. 
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Figure 4.2, The major -OWM. inputcases;; 



OBJECT i - !-.■>-.:,:,..•- -,• ;v .-..,;: ;=,-- .,.:!;;•:; ■ , -' 

This is the input to the method that is most important to the identity of the process. 
For exampte, the prqc#svof bafci^J)«ad is siroite- bo matter who does it. but baking a 
souffle is a very different process from baking bread, 
ex M*ke«#a*Mf from day ^ 

Empty the boat of water. 

AGENT 

The AGENT is the participant responsible for the action, 
ex. / hit the ball. 

CO-AGENT 

This is someone who bas a resporoibtUty level equal ,.to that of the AGENT for some 
part of the action, 
ex. //orry helped rne move the couch 

BENEFICIARY 

This is an individual who receives the benefit of the action without major active 
participation in it. 

ex. Answer the phone for John. 

SOURCE 

The SOURCE describes a former position or state of some other case-filler in «.he 
action (usually the OBJECT), 
ex. Take a block from the box. 

DESTINATION 

The DESTINATION is to the future as the source is to the past. It is a point, object, 
etc. toward which some other case^filler in the action <usually the OBJECT) is tending. 
ex. Tell it to the judge. 

TRAJECTORY 

This is the path taken by some other case-filler hi the action, 
ex. Run across the street to the store. 

DIRECTION 

This is the direction taken by some other case-fiBer in the action, 
ex. Head south across the desert to Mexico. 



» 



SPECIFIC-LOCATION 

This is a location associated with the action. Some activities like "ride" take a 
location which is quite specific to it; there are only certain things which can be ridden in. 
ex. Ride in a dog sled in Alaska. 

INSTRUMENT 

The INSTRUMENT is something used as a tool in an action; it is left over 
afterward. 
ex. Cut the butter with a knife. 

SPECIFIC-RAW-MATERIAL 

This is something used and consumed in the course of the action, 
ex. Bake a cake with powdered eggs. 

TOPIC 

This case is used for the topic of mental and communication activities, 
ex. Talk about a bear. 

EXCHANGE 

• This is something received in exchange for another case-filler in the action, 
ex. Trade seeds for food. 

RATE 

This case gives a measure of speed, cost, etc. in relation to units or to something else, 
ex. Rent a room at three dollars a day. 



Figure 4.2 <cond.) The majer OWH input cases 
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undertaken. For example, the action conveyed in "Paint the block red." has as principal 
result that the block is red. The paint brush may also end up fed. but this is not the 
principal result. In Figure 4.1 request-and-respond has its PRINCIPAL-RESULT output 
case set to the results of the action desired by the AGENT. When the method is 
successfully completed, the action has occurred and its results obttm. 

This takes care of the header and argument specifications; we turn now to the 
procedural body, the first component of which is the prerequisite. The request-and-respond 
example has no prerequisite. In general, none have been used in the representation of core 
dialogue methods. Nevertheless, prerequisites are a useful structure for other sorts of actions. 
A familiar example of a prerequisite is the requirement that an elementary course of study 
be completed before a more advanced one is undertaken. Note that prerequisites differ 
from other entry conditions in that, if a prerequisite is not satisfied, • method is not rejected 
as a possible course of action. Instead, measures are taken to satisfy the prerequisite; the 
execution of the method then continues. 

The second part of the procedural body is the set of procedure steps. These are 
either calls to other OWI.-I methods used to model subactJom, assertion! of stales, or calls to 
"black box" procedures. The last are procedures represented in a conventional 
programming language (eg., LISP); these are intended for complex numeric or linguistic 
computations, or for any procedures whose steps are not to be included explicitly in the 
model. (We will see a use of "black box" representations below.) 

Recall that procedure steps were divided into standard path and recovery path 
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steps. 25 1 wifl say something more about each. First, standard paths are the normaHy 
expected sequences of events for an action. They embody the relatively small number of 
ways that an action can be concluded successfully. i.e. can be concluded such that any 
associated goals are met. When an English description of an action is given, it is the 
standard path steps that are typically included. If deviations from the standard path(s) are 
described at all, it is only the most important ones that are given. This indicates that the 
steps in the standard paths of an action embody the minimum of necessary information 
about it. For example, when giving someone directions for getting somewhere one normally 
describes only the successful routes. In the normal case, one does not discuss all the many 
ways someone couki get back on the route again after having made a wrong turn. Even 
when a particular mistake is very common or particularly costly, one usually describes how 
to a void it (eg, "Don't foHow the sighs") rather than how to recover from It. 

To decide which steps belong to standard paths, one can ask whether it would 
normally be necessary to describe a step when describing the action, or whether it would be 
necessary to describe some step of Which it is a substep (This second case applies to tew- 
level detail left out because it is either obvious or unnecessary.) Another useful criterion is 
whether the absence of a step is seen as an event. (This critertoh is simftaf to the one used 
by Schegloff and Sacks in [28] to determine what should go in a sequence.) For example, if 
someone asks a question and does not get an answer, this is generally worthy of note. If one 
asks a question and the hearer does not ask for clarification, this absence of discussion is 



25. This is not quite the whole story. A third type of step is the assertion of a failure 
condition. Failure conditions are represented as disjunct* off standard paths or recovery 
paths. 
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not, in most cases, considered remarkable, The standard path steps, then, are the normally 
expected sequences of events for air action, (bete whose non-occurrence is normally 
considered an event 

It is fine to follow standard path steps as long as the action goesas intended and no 
expectations are violated, In practice this will probably not be king, and a failure, with its 
attendant recovery actions, wHl result^ Some recovery actions are very general and should 
be modelled by independent methods. One type of general example is the recovery from a 
failure in the basic conditions of a dialogue, eg, noise on the Une in a telephone 
conversation. Many other failures, however, ace quite specific to the task being carried out. 
Recovery from such failures is modelled by recovery paths in relevant methods. Recovery 
paths handle domain specific type) ; of failure recovefy, and one of their advantages is that 
they allow the modeller to put in special instructions for doing what is necessary to return to 
an appropriate standard path. 

The standard path steps for the request-and-respond method are steps 1 through 6 in 
Figure 4.1. Two recovery paths are also shown lor requett ami-respond, although many 
more exist to be modelled. The first recovery path m Figure U could be used to account 
for a response such as 44, and the second could account for the exchange in 4.2 responding 
to a REQUEST to write a computer program. 

4.1 That's not my department. Try Mr. Jones down, (he haH. 

4.2 Cp- AGENT: I can only w* ite block s world programs. 
AGENT: That's what I want. 



26. For my purposes here, I define failure as the violation of an expectation. This is 
slightly broader than its normal usage, and I do not intend for the usual negative 
connotations to accompany it. : . 
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The failures that trigger both of these recovery paths occur in the process of checking the 
preconditions, step \ but the recovery paths are associated with the request-and-respond 
method Different recovery paths for these failures could be used when a precondition- 
checking method is called from another method, eg., request-and-hdp. Note that the second 
recovery path itself has an associated failure condition, which would Justify the inclusion of 
an additional recovery path in the appropriate method. Although onh/ the recovery paths 
in Figure 41 contain conditional steps, note that such conditionals can occur in standard 

paths as well 

One final point that Figure 4J illustrates about dialogue methods is their abbreviated 
form. For any given speech act, only production is represented explicitly, and the 
understanding process carried out by the other partner is left implicit For example, the 
first step of request-and-respond is that the AGENT makes the REQUEST, but no explicit 
step is given to represent the process of the CO-AGENT understanding the REQUEST. 
Thus, although the methods are speaker independent in the technical sense defined, they 
are not without a bias: it does not matter who Is specified as the AGENT of a speech act 
step. but. whoever is. the "story" is told from his point of view. "Listening" steps are left 
implicit not because they are unimportant, but because the form and timing are predictable. 
Where a joint model of communication is necessary (eg., when misunderstandings occur) it 
is necessary to expand the abbreviated model expressed in the dialogue methods. 

This takes care of the request-and-respond example and with it the general 
description of methods. A few words remain to be said to link methods to the treatment of 
speech acts described in Section 2. First, note that speech acts such as REQUEST have 
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their own associated methods. This is in accord with the major thrust of speech act theory, 
which views utterances not merely as saying but also as doing. The speech act methods 
have semantic case specifications for an OBJECT, AGENT, and DESTINATION. 27 Here, 
the OBJECT specification generally corresponds to the propositlonal content condition 
identified by Searle. For all speech act methods, the procedure steps are represented by a 
call to a "black box" procedure for generating utterances. The preconditions of speech acts 
are represented in the methods as entry conditions associated with the AGENT of the 
speech act. Preconditions also appear, possibly somewhat restricted, as constraints on the 
AGENT of the higher level core methods built around the speech acts. 

Turning to speech act taxonomy, recall that I adopted the principal intended effect of 
the speech act as a major taxonomizing principle. This notion of intended effect can be 
related to core methods via the OWI-I output case PRINCIPAL-RESULT. More 
specifically, the principal intended effect of a speech act corresponds to the action directly 
precipitating the PRINCIPAL-RESULT(s) Of a core method built around the speech act. 28 
For the request-and-respond example, the principal intended effect is step 5, whose results 
become rhe PRINCIPAL-RESULT of the core method. 2* We tan say then, that core 



27. DESTINATION is used rather than CO-AGENT because the second participant in a 
speech act is seen as passive This does not mean that this participant does nothing, since 
he or she is doing the work of trying to understand the utterance. By passive I merely mean 
that the second participant is basically receiving the information, rather than playing a 
major role in shaping it. 

28. Where a speech act has more than one associated core method, the principal intended 
effect win be a generalization that covers the different actions that directly precipitate the 
PRINCIPAL-RESULT(s). 

29. Note that principal intended effects are often, but not always, the last step in a core 
method. For example, we can construct a core method called state-and-acknowledge, 
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methods accomodate the taxonomizing principle used for speech acts in Section 2. 



*** 



In summary, methods provide a general representation for actions. In this section. I 
have emphasized the way that speech acts can be integrated into larger patterns of action, 
using methods as a representation. In Section 7, method structure will be used in 
formulating an important class of rules for ISAs. 



consisting basically of a speech act of stating (done by PI) followed by an understanding 
and acceptance operation (done by P2). This latter step would be the principal intended 
effect of the speech act. but it is not customarily the last step in the core method. Instead, the 
last step of state-and-acknowledge is typically an acknowledgment (done by P2). 
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5. Three Approaches to the Treatment of Indirect Speech Acts 

All of the foundation is in place, and we can now turn our attention to ISAs. To 
start, I briefly outline three approaches to the treatment of ISA* thar have been prominent 
in the linguistics literature. Too much has been written on this topic for me to attempt a 
comprehensive summary here, and I intend only to use these approaches to motivate the 
analysis in Sections 6 and 7. 

5.1. Gordon and Lakoff 

The first approach of interest here is that taken by Cordon and Lakoff [10]. 
Concentrating primarily on requests, Gordon and Lakoff propose a set of four sincerity 
conditions 70 and then give a single powerful rule to account for the different ways that a 
request can be framed. They say that to make a sincere request, a speaker must want the 
action done and believe that the hearer can do the action, that he wants to do the action, 
and that he would not do it if he were not asked to. The first of these sincerity conditions 
is called speaker-based and the next three are called hearer-based. The rule given is: 



One can convey a request by (a) asserting a speaker-based sincerity condition or 
(b) questioning a hearer-based sincerity condition. 



This formulation is very attractive because fc is so elegant and simple, but it has 
encountered extensive criticism. Sadock [24] points out that Gordon and Lakoffs rule is 



30 It is not clear whether Gordon and Lakoff mean something different by sincerity 
condition than Searle does. At any rate, their choice of sincerity conditions for requesting 
differs from Searle's. 
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too powerful, admitting utterances that are not requests. For example, 5.1 is not a request for 
the hearer to move over, even though the similar form 5.2 is. 

5.1 Tell me if you can move over. 

5.2 Can you move over? 

Even if the set of sincerity conditions could be recast to generate only the correct forms, 
there are still facts that cannot be accounted for by an approach that uses only speech act 
conditions plus a general rule. Sadock gives the following examples: 

5.3 Can you close the door? 

5.4 Are you able to close the door? 

5.5 Can you please close the door? 

5.6 >>Are you able to please dose the door? 

Sentences 5.3 and 5.4 can both be used as requests, and in this usage they are paraphrases of 
each other. Taking Gordon and Lakoffs approach, both 5.3 and 5.4 would therefore be 
derived from the same sincerity condition. Sentence 5.6, however, is ungrammatical, and 
any scheme that derives 5.3 and 5.4 uniformly will be hard-pressed to account for the 
difference between 5.5 and 5.6. 

5.2. Sadock 

To answer the objections that he raises against Gordon and Lakoffs theory, Sadock 
in [24] distinguishes between utterances that have a speech act as their meaning and 
utterances that mean one speech act but entail another, Thus, according to Sadock, 5.3 is a 
sort of idiom which entails the notion of request as part of its meaning, white 5.4 is a 
question according to its meaning, only secondarily entaihng a request. Sadock Y claim is 
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that, although rules based on sincerity conditions have their place, they cannot account for 
the behavior found in speech act idioms. 

Sadock's observations seem f to be weir taken/ although his distinction between 
meaning and entailment appears to me to be ^imat*ly^^iniileadii%: The dn^culty, ! think, 
is that the notion of what a meaning is and what ^formation ft Should contain are not as 
well understood as the theory implies. Sadock assumes that each'*senieike has cmly en* 
semantic representation that constitutes its meaning; and he himself notes that this makes 
him unable to account for the deliberate use of alnbigHiity in conversation. 

There is other evidence that Sadock's treatment falls short of a precise 
characterization of ISAs. Note that some sentences treated by Sidbck as speech act idioms 
have conversational properties based on their surface forms. Consider examples 5.7 and 5.8 
used as requests. 

5.7 Move over. 

5.8 Can you move over? 
59 OK 

5.10 1 don't know if I can. 

511 Yes. 

5.12 I don't know. 

Examples 59 through 5.12 are possible responses to 5.8, but only 5.9 and 5.10 are appropriate 
as responses to 57. The indirect request form 5.8. then, can be treated as either a request or 
a question by the hearer for the purpose of response. Note, however, that 5.8 can still be 
understood as a request even though it is answered as a question. One may respond with 
"Yes" and then go on to comply with the request. In fact, to answer "Yes" and then fail to 
comply indicates either deliberate rudeness or failure to understand the request force of the 
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utterance. If we follow SadocV and treat 5.8 as an idiom when used as a request, then only 

the request force, and not the question force, will appear in its semantic representation. It is 

then impossible to account for, the response rJthayiQ£de#ribed. 

Another pbertpmenon that Sado^k'stho^-y does ^ accojir^lor is the varation in the 

indirect forms that are possible for "entailed" , requests. For example, Sadock considers 

questions of the "Are you able to ..." form to entail requests. He also observes that questions 

of the form "Do you know interrogative cbuse>?" can have the ilkxutionary force (and 

hence in his scheme the meaning) of *he interrogative clause. That is. 5.13 is often 

equivalent to 5.14 rather than to a question about the knowledge of the hearer. 

513 Do you know what time it is? • * ■ . 

5H What time is it? 

Putting these two facts together, Sadock's theory would predict that 515 should entail a 
request, but it does not. 

5.15 Do you know whether you are able to close the door? 

Hence, even forces that Sadock considers entailments of utterances - and. therefore totally 
meaning-dependent - seem to have some dependence on surface form. 

5.3. Searle 

A third approach that is of interest to us here is that of Searle in [301 I start with a 
discussion of some generalizations that he proposes and then compare his approach to that 
of Sadock. 
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Searle presents four generalizations for directives and five others for commissives. 
These play a role in Searle's theory analogous to Gordon and Lakoffs single rule, but 
Searle s generalizations differ from theirs in the following important ways: 

1. Searle's generalizations are differentiated according to the parts of the speech act, 
te proposition*! content conditions, sincerity conditions, an* preparatory conditions. 
Gordon and Lakoffs sincerity conditions, in contrast, seem to be an amalgam of 
Searles' sincerity and preparatory conditions. 

2 Searle explains an asymmetry in his generalUations by identifying the 
conversational participant who has the better knowledge of a condition, but he does 
not state this explicitly in the generalizations. For Gordon and Lakoff, the question 
of which participant has the better knowledge is at the core of their single rule. 

Searle's approach seems to me to be a valuable one. in that the generalizations are 
more finely differentiated than Gordon and Lakoffs rule. At the same time. I think that 
Searle's generalizations can be questioned on the count that they are too specific. Recall 
that the notions of speaker- and bea*er«based conditions have not been given an explicit 
place in the generalizations. Because of this. Searle is forced to state the generaliiations in 
terms of specific types of preparatory conditions, for example, rather than in terms of 
preparatory conditions in general, Although Searle is correct in looking for finer 
differentiation than that afforded by Gordon and Lakoffi rule, it appears that his 
generalizations are needlessly complex. See Section 6 for an alternative proposal. 

Searle's generalizations are presented as part of a larger framework. It is instructive 
to compare this framework with that developed by Sadock, since there are some sharp 
contrasts Searle makes the distinction between meaning and use, saying that an utterance 
such as 516 should be seen as meaning a question while, at the same time, it may be used as 
a request. 
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5.16 Can you pass the salt? 

5.17 Are you able to pass the salt? 



The fact that 516 takes pttaa while 5.17 does not is accounted for by a maxim that one 
should speak idiomatically in the normal case. Thus. Searle sees a request such as 5.K> as 
"idiomatic hit not an idtom? One attraction of this approach it that it accounts naturally (as 
Searle points out) for the fact that responses to speech acts may correspond to surface form 
as well as to a conveyed speech act. 

There seem to be two major difficulties with Searte's approach. First, although he 
acknowledges special grammatical behavior of some ISA forms. Searle is not willing to say 
that such forms "mean" any speech act other than the one co rr esp onding to the surface 
form. Utterances sHch as example 5.5 PCan you please..." forms) are treated as special 
conventionalized forms/which is to say as very idiomatic (but still not idioms). If the pfaur 
in 55 is to be accounted for. this apparently mint %e done hi terms of use, not meaning. 
Thus, we have both use and meaning reflected in some surface forms, which blurs the 
use/meaning distinction to some extent. To Justify such a step, 1 think we need a more 
developed account of the notion of "use'anda delineation of its impact on surface form. 

Another difficulty with a strict meaning/use distinction appears hi the variations 
possible for a class of ISA forms that will be discussed in detail in Section 7.5. When 5.19 is 
used in place of 5.18. I assume that Searle would call the statement that is conveyed the 
meaning and the request that is conveyed the use. How, then, does 5.20 fit into this 
framework? 

5.18 Close the door. 

5.19 It's cold in here. 

5.20 Must I tell you that it's cold in here? 
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As I analyze it, example 520 can be used to convey three speech acts, first the "Must I tell 
you..?" ASK, second the "It's cold in here" representative, and third the "Close the door" 
REQUEST. Note that either response below would be acceptable: 

5.21 Oh I'll close the door. 

5.22 All right I'll close the door. 

The following response would also be possible, although rude: 

5.23 Yes, you must. 

It is not clear how a strict meaning/use approach would account for the three levels of 
representation involved here Note that any change to the theory to accomodate "triple 
message" ISAs will not be simple to make, due to restrictions on degree of indirection for 
other ISA forms, as exemplified by 5.1 and 5.2 above. 

It appears, then, that none of the ISA theories presented is totally adequate. All, 
however, have extremely attractive aspects that are worth preserving. Gordon and Lakoffs 
approach attempts to derive the ISA forms from general speech act conditions that have 
independent uses Sadock, while endorsing such general derivations, emphasizes the special 
grammatical properties of some ISA forms. Although Searle's overall approach differs from 
that of Gordon and Lakoff. his "generalizations" can be viewed as a more finely 
differentiated version of Gordon and Lakoffs (too) simple principle for deriving ISAs. In 
the approach described in the next sections I have preserved these positive aspects while, 
hopefully, avoiding the negative ones. 
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6. Some General Rutes for Indirect Speech Am 

One common property of the three theortet discussed hi the hut section is the use of 
general rules to account for ISAs. While the theories differ in the nature of individual rules 
and the relative importance given to them, nevertheless general rules for ISAs are included 
in each case. In light of the large number of ISA forms, such general rules appear to be 
necessary. I follow Sadock, however, in questioning whether they are sufficient. In this 
section I present a set of general rates for deriving ISAs from properties of speech acts. 
These rutes are intended as a basis only, and they will need to be both augmented and 
restricted. Accordingly, in Section 7 I present another set of rules and categories that, 
combined with the general rutes from this section, give a more complete picture of the range 
of ISAs possible. The reader is asked to keep in mind, then, that this section is intended as 
only part of a theory, to be amended in Section 7. 

The ISA rutes presented in this section will be divided into three categories. The 
categories are based on a common sense view of rational human behavior. The categories 
are based on the following three maxims of action: 

1. One should (only) initiate actions that are necessary. 

2. One should (only) initiate actions for which some desirable result or results can be 
expected. 

3. One should only initiate actions that one expects to be possible. 

These three maxims will be referred to as the maxims of Necessity, Desirability, and 

Possibility. 
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Readers familiar with the classic work faGTmwxvaytx$alim^^r^\^iiareWlyii\\ 
recognize the approach that is being taken. There, Grice suggests four categories of 
maxims that are applicable to linguistic actions but which have analogues in other types of 
actions. Here. I am stating maxims applicable to actions in general but which apply to 
speech acts as a special case. The maxim of Necessity above has a counterpart in Grice's 
category of Quantity. The other two maxims have no direct counterparts, and they suggest 
extensions to Grice's framework. 

We will use the maxims of Necessity, Desirability, and Possibility to provide a 
conceptual organization for ISA rules. Accordingly, the next three subsections discuss rules 
of each type. Section 6.4 looks at some general behavior with respect to mood and tense. 
Finally, in Section 6.5 I discuss a fourth type of ISA that may call for a somewhat different 
approach. 

6.1. ISA Forms Related to the Maxim of Necessity 

The maxim of Necessity says that one should act when necessary, but one should 
avoid extraneous actions. When this second clause is applied to speech acts, it yields the 
following rule-. 

Rule 6.1 

PI can convey a speech act indirectly by - 

(i) ASKing whether the intended speech act is necessary 

eg., the R EQUEST "Do I need to tell you to shut the door?" 
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(ii) ASKing whether an equivalent speech act (i*., one with the same principal 
intended effect) has already been performed 

e.g.. the REQUEST "Did anyone ask you to take out the garbage?" 

(iii) ASKing whether the principal intended effect can be expected to occur without 

the speech act 

e.g. the REQUESTS: 

"Are you planning to take out the garbage?" 
"A re you going to take out the garbage?" 

(iv) ASKing whether the principal intended effect of the speech act has already 
occurred 

eg, the REQUESTS: 

"Did you take out the garbage?" 
"Have you taken t«it the garbage?" 

For Rule 6.1 and other rules presented in this paper. I am assuming a relatively constrained 
relationship between the actual representations of the rules and the semantic representations 
of individual utterances See Appendix B for a description of this relationship. Although 
REQUEST examples were used in 6.1 (and will continue to be used in this section), the 
general ISA rules are intended to apply to any speech act where P2 can be expected to have 
the appropriate knowledge to respond to Pi's ISA. 

In Rule 61, note that Clause (iv) is also expected to account for forms such as 6.1 used 

as a REQUEST to take out the garbage 
6.1 Is the garbage out? 

This will require additional rules and distinctions, however, which will be Introduced in 
Section 7.5. 
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Earlier versions of this paper accounted for th« "Witt you <ac*Jon>?" REQUEST form 
by an appeal to Clause (iii) of Rule 61 I have not been satisfied wHh this account, 
however, because some uses of the form are not motivated by questions of the necessity of 
the action. Consider, for example, 6.2. 

6.2 Wilt you accept a ride to the airport? 

This example can be accounted for by the maxim of Possibility. One can view 6.2 as PI 
asking P2 whether the outcome of an OFFER by PI will be successful (Le, acceptance). 
While some readings of the "wifl" form may be accounted for by Rule 6J, we need a broader 
view of this form. See Section 6.3. 

Finally, note that PI is permitted to use an ISA only when PI can reasonably expect 
P2 to decifer Pi's intent, i.e„ to recognize the indirection. Neither Rule 6J nor any of the 
other rules presented here, however, includes this Information. It appears that this 
constraint is part of a more general constraint that PI avoid ambiguity. That is, PI is 
obligated - to the best of his or her ability - to frame any utterance (ISA or not) in such a 
way that P2 can understand the message that PI intended to convey. See Grice DO for 
discussion of an "avoid ambiguity" maxim 

6.2. ISA Forms Related to the Maxim of Desirability 

Next we come to the maxim of Desirability, restated here 

One should (only) initiate actions for which some destrabte remit or results can be 
expected. 

Related to this maxim, we have the following general ISA rule 
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Rule 6.2. 

PI can con v ey a speech act Indirectly by 



-- a 



representative with the prepositional content that some desirable result or results 
can be expected for some intended effect of the speech act. 

eg., the REQUEST "I would be happier if you'd substantiate those figures" Here, 
the desirableWilt is the happiness of PI, and the intended effect of the REQUEST 
is that P2 substantiate the figures. 

Again, for Rule 62. although the example is a REQUEST, the rule Is expected to apply to 
speech acts in general. 

In Rule 62. note that the intended effect need not be an immediate RESULT or 
PRINCIPAL RESULT (see Section 4) of the speech act; it may be several times removed in 
the causal chain. Similarly, the desirable result need not be an immediate RESULT or 
PRINCIPAL-RESULT of the intended effect. Thus, we have tM following examples of 
indirect representatives. 

6.3 You'B be happy to hear that Claudia won. 
{Hear refers to a RESULT of the speech act.) 

6 4 You'll be happy to learn that Claudia won. 

{Learn refers to the; principals intended effect of the speech act.) 

65 YWI* to* nappy to know that Claudia won. 

(Know refers to the PRINCIPAL-RESULT of the principal 
intended effect of the speech act.) 

Note that the desirable results? from the maxim need not relate only to PI and P2. 
Consider the following REQU EST example* taken Irom [141 
6.6 This room would look a tot better if you dusted it. 
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Example 6.6 contains information about an inanimate object with the claim that it would be 
more attractive (to anyone) if the REQUESTed action' were p e rf orm ed Neither PI or P2 is 
explicitly merttioned in the statement of theresatt. 

6.?. ISA Forms Related to the Maxim of Possibility 

The third ma.xtm proposed was the maxim of Possibility: one should only initiate 
actions that one expects to be possible. The minimal condition* for this maxim to obtain 
can be stated in terms of the framework developed in Section 4. A speech act can be 
considered to be possible if: 

1. the preconditions of its associated method can be satisfied. 

2. for an action-centered speech act, the semantic input cases of the action named can 
befitted. 



3. for an action-centered speech act, the prerequisites of the action named can be 
satisfied. 



The maxim of Possibility has a rich set of corresponding ISA rufcs. Rules related to (I) and 
(2) above are explored in the next two subsectMns. Rules related to <# above are discussed 
in Section 7.5 2. 

6 3.1. Forms Based on Preconditions of Individual Speech Acts 

The first set of ISA forms based on the maxim of Possibility are those derived from 
preconditions of speech acts. Now? that the preconditions associated with the speech acts 
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give conditions particular to the speech act. There are other general conditions of speech acts 
which must obtain for the individual act to be well-formed, as evidenced by rules 6.1 and 
6 2 I am not including these when I refer to preconditions The approach taken in this 
subsection will be to distinguish three classes of precondition and formulate seven rules 
using the classes distinguished. 

While the rules suggested are intended to apply generally to speech acts, once again 
REQUEST will be used to supply examples. (Examples for OFFER. SUGGEST, and ASK 
are given in Appendix C.) To start, then, recall the preconditions identified in Section 3 for 
REQUESTS: 

I PI wants P2 to take responsibility for carrying out the action. 

II PI believes that P2 can take responsibility for carrying out the action. 

III. PI believes that P2 is willing to take responsibility for carrying out the action. 

IV. PI believes that P2 is obligated to PI (and possibly to others) to take responsibility 
for carrying out the action (This obligation m*f*>e a role obligation, an authority 
obligation, or a general obligation td be helpful.} 

The preconditions of REQUEST, and other speed* act preconditions, can be divided 
according to which (tfahguc participant has tht bettet - ktu>6He4ge<fiM ttmdiUons specified. 

I. Pi-based preconditions. 

Here PI has inherently better knowledge *f whether or not the topic of the 
precondition holds. The topic of preconditions that begin here with "PI believes" is 
considered tobeihe direct object of the initial ''beteve.f For other preconditions (e.g.. 
REQUEST I ), the topic is the entire pattern. Preconditions that are Pl-based include 
those whose topic is a goal of PI (eg, REQUEST fe|. 
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2 P2-based preconditions 

Here P2 ha* inherently better knowledge of whether or not the topic pf the 
precondition hoHs. Preconditions that fit this category include Pi's beliefs about P2s 
inter* wreii *tate$. An example of a P2-ba$ed*recoiid*ioi*i* iRKgJfi&T -4111). 



3. Unmarked preconditions. 

For these preconditions, determination about which participant has the better 
knowledge of the precondition depends on ,p»ope*|ie* of the particular speech act 
and/or its context. Examples are R EQUEST (II) and (IV). 

Using these precondition types, we can construct the following seven rules for ISA 
forms. 

PI can convey a speech act indirectly by - 

Rule 6.3 (for Pl-based preconditions). 

- a representative of the topic of a Pl-based precondition of the speech act. 

e.g.. 1 want you to water the plants, (REQUEST 1) ., . 
1 hope you will use common sense (REQUEST U 

Rule 6.4 (for P2^based preconditions): 

- an ASK of the topic of a P2-based precondition of the speech act. 

e.g.. Do you want to shut the door? (REQUEST III.) 

Rule 6.5 (for unmarked preconditions): 

-- an ASK of the topic of an unmarked precondition of the speech act. 

31. It is probably the case that for aH PMwsed preconditions, the topic is exjuivalent to 
the entire precondition. This rule could probably be stated simply as "a 
representative of a Pl-based precondition," but for now I stay with the more general 
version. 
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This rule applies in a context where PI believes P2 has better knowledge of the 
condition in the precondition topic. 

eg.. Is it your ttirn to do the dishes? (REQUEST IV.) 



Rule 66 (for unmarked preconditions): 

- a representative of the topic of an unmarked precondMen of the speech act. 

This rule applies in a context where PI believes PI has better knowledge of the 
condition in the precondition topic 

eg.. It's your turn to do the dishes. (REQUEST IV.) 

Rule 6.7 (for unmarked preconditions): 32 

- a representative of an unmarked precondition of the speech act where PI 
believes he or she has better knowledge of the condition. 

e.g.. I believe it's your turn to do the dishes. (REQUEST IV.) 
I assume you are able to shut the door. (REQUEST H.) 



Rule 68 (for groups of preconditions). 

- a REQUEST form of an action that is a goal of PI (ie.. A for "PI wants A"). 

This rule is applicable only when the speech act has preconditions that are 
exact matches or specialuations of the four preconditions Of REQUEST. 

eg.. Take a cookie. (OFFER IV.-VH) 



32 Utterances derived from this rule could also be treated as denote Indirections, eg., an 
indirect form of a representative of the speech act. If this is done, however, these 
utterances would be special cases, since double indirections are not possible in general. 
(Recall example 5.1. "Tell me if you can move over." which is not a REQUEST that^P2 
move) Whatever the treatment of these utterances, note that the initial "I believe. I 
assume." etc. functions more as a mitigator than as a separate speech act. For a discussion 
of mitigators. see [14]. 
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Rule 6.9 



-- an ASK about whether P2 will take responsibility for carrying out an 
"active" action that is a goat of PI (Le^ A for "PI wantls A">. $-; 

This rule is applicable only when the speech act has preconditions that are 
exact matches or specializations of the four precond&ieiif of REQUEST. 

eg. Will you, accept a rid* to the airport? (OFFER IV >?VJt) 



Although these seven rules are more complex than Gordon and Lakoffs single rule, 
they give us a more powerful basis for identifying the set of precondition-based ISA forms. 
First, nothing corresponding to Rules 68 and &0^which operate on groups of 
preconditions) is found m Cordon and Lakoffs theory. Second, the treatment of the 
precondition classes is more powerful here. Nothing in Cordon and Lakoffs dichotomy 
between speaker- and Nearer -based sincerity conditions predicts conditions that can either 
be questioned or asserted, depending on context. 

The major difference between Rules 6.3 to 6.9 and Searle's generalizations rests in the 
level of generality of the two schemes. Recall that Searle's generalizations are more 
numerous because they are associated with different classes of, speech acts. In contrast, the 
rules given here apply to speech acts in general, and they are proposed as the full set of 
general ISA rules that apply to preconditions. 

In terms of specific rules, rules 6.8 and 6.9 differ most from Searle's generalizations, 
because, again, they operate on groups of preconditions. Rule 6.9 is of special interest In 



33. By "active" 1 mean an action that is viewed as an event in the environment 
shared by PI and P2. For further discussion, see Section 6.4. 
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Searle's scheme, the very common "WW you <actton>?" form is derived from the 

propositional content condition of directives. I feel that this approach lacks semantic 

motivation, and so the approach taken here is to appeal to the maxim of Possibility. One 

can interpret a "will" question as a question about how P2 wHI respond to Pi's speech act. 

The "will" ISA form can be seen as a hedge in case P2 cannot respond appropriately, does 

not wish to do so. or recognues no obligation to do so. By using this form, PI can carry out 

the intended speech act even though he or she may have some reservations about Its effects. 

Anchoring "will" forms in groups of preconditions and appeaHing to the maxim of 

Possibility provides these forms with a stronger semantic motivation than the more 

structural account offered by basing rules on propositional content conditions. 

Given the constraints on matching described in Appendix B. Rules 6.3 through 6.9 

are relatively restrictive. I will conclude this subsection with some observations on the 

forms that the rules do not account for. First, the rules as written do not account for 

differences in tense and mood. That is. Rule 6.5 generates example 6.7 but not 6.8 and 6.9. 

6.7 Are you able to drive Sarah to school? 
6 8 Will you be able to drive Sarah to school? 

6.9 Would you be able to drive Sarah to school? 

Since 6 8 and 69 are legitimate indirect R EQU EST v they must be accounted for; Section 
6.4 contains proposals for handling this tense and mood behavior. 

Another collection of forms that is not covered by Rates 63 to 6.9 are those that 
contain negations of the sort found in patterns 6-lftto 6.B; 

6.10 Won't you <action>? 

6.11 Couldn't you <action>? 

6.12 Shouldn't you <action>? 

6.13 Can't you <action>? 
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Application of Rules 63 to 69 to the REQUEST preconditions does not yield such negated 
forms, in this case, I think that the exclusion is WelMounded Forms 6.K) to 6.13 can be 
interpreted as conveying several different speech acts, but two of the more prominent ones 
are begging and persuading What these two speech acts have in comm o n is Pi's need to 
overcome some sort of resistance in P2 (The resistance Is to the action named in the speech 
acts, both are action-centered) REQUEST does not involve overcoming any special 
resistance; precondition (IH) relates to Pfs estimate of P2Y openness to the action. Thus, 
while forms 6.10 to 6 13 are clearly similar to REQUEST forms, they are not treated here as 
REQUESTS 

6.3.2. ISA Forms Based on Semantic Cases of Actions 

We continuing with the maxim of Possibility and observe that ISA forms for some 

types of action-centered speech act are based on semantic input cases of the action named. 

Consider, for example: 

6.M Do you have a rope to throw to that man? 
(OBJECT of the throwing action) 

6.15 Have you got a hammer to fix that picture? 
(INSTRUMENT of the fixing action) 

6.16 Do you have enough gas in your car io drive George to the airport? 
(SPECIFIC-RAW-MATERIAL of the driving action) 

All of these examples can (among other things) act as indirect REQUESTS for P2 to do the 
action named. Given the appropriate context, they can abo act as indirect SUGGESTS. 
(One such context would be the case where P2 is trying to decide what to do m a situation.) 
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The basic pattern that these examples fit is given in the following rule: 



Rule 610 

PI can convey a REQUEST or SUGGEST indirectly by - 

- ASK ing whether VZ has <iten* rfiat fiWs a semantic case $**> for/to <actton> 



Here, the semantic rase is one defined for the action named. It is not clear whether this 
formulation is too broad; a pattern involving only a subset of the semantic cases may be 
more correct For insrance. a partner in 6.17 is a filler for the CO-AGENT semantic case. 
Example 6.17 littered by a tournament official can definitely be a REQUEST that P2 get a 
partner, but can it also be used as a REQUEST that P2 play the next match (as Rule 6.10 
would have it)? 

6 17 Have you got a partner to play the next match? 

While example 6 17 suggests that Rule 6 10 is too broad, other evidence suggests that it 
is too restrictive In addition to examples that correspond quite well to the semantic input 
cases identified in Section 4, there are other examples that are clearly related but not as easy 
to account for. Consider: 

6 18 Do you have time to take Sarah to the airport? 

6 19 Do you have room to store my plants? 

Examples 6 18 and 6 19 are time- and space-related, respectively When representing actions 
with OWI.-I methods, it is assumed that all actions occur at some time, but, because of the 



34 It is possible that this rule can be generaliied, since analogous, but not always identical, 
forms exist for other action-centered speech acts. 
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generality of this attribute, time is not represented as a semantic input case, (This does not 
preclude the use of a different representation to associate times with particular events.) Even 
if the starting time of an action were represented as a semantic input case, this would not be 
an immediate solution to the problem of accounting fat example 6.18. This is because the 
time discussed there is elapsed time, not a single point in time. Similarly, example 6.19 can 
be related to the semantic case SPECIFIC-LOCATION, but it seems to be a Question about 
the size of a place, rather than simply a place. White we could consider extending Rule 6.10 
to include attributes of semantic cases, we run the risk of admitting other forms that are not 
ISAs Pending a solution to the difficulties posed by examples 6.17 to 6.19, 1 leave Rule 6.10 
as stated. This is done, however, with the caveat that it merits further investigation. 

To sum up this subsection, we can say that for at least some action-centered speech 
acts, ISA forms are based on semantic input cases of the action named. Work renmiro to be 
done to establish the exact subclass of action-centered speech acts that permit this type of 
ISA form. 

6.4. Tense and Mood Variations 

This completes the presentation of ISA rules associated with the three maxims of 
action. In this subsection 1 briefly discuss the derivation of future tense ISA*, then go into 
somewhat more detail for subjunctives. 

The following rule expansion can be applied to all ISA rules: 

Expansion for future tense: 

If an action-centered speech act refers to an action thai U to take place sometime 
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other than ."right away," then the ISA foimmay be stated tn the future tense, if 
this is appropriate. 

Although the future expansion and the subjunctive expansion below are rules, note that 

they are not at the same level as the other ISA rules presented so far. The expansions are 

rules that act on other rules. Alternatively, it would be possible <but wordy) to write 

individual ISA rules with separate clauses to account for tense and mood behavior. 

In the future expansion, the "if this is appropriate" hedge is used to account for 

differences such as those between the REQUEST forms 6.20 through 6.23. 

6 20 Will you be able to <action>? 
621 Can you <action>? 
6 22 Witt you be willing to <action>? 
6 23 ?Will you want to <action>? 

While 6.20 and 6.22 occur in the future tense, there is no future for can and 6.23 is 
questionable as an indirect REQUEST. Some properties, apparently, are viewed as more 
time-variant others It is this difference that the "if appropriate" hedge is meant to allow 

for. 

Turning to the subjunctive, we can frame the following rule expansion: 

Expansion for subjunctive: 

ISA rules that generate ASK forms where the state or action that will be named 
by the finite verb is not set in the past may also be used to generate subjunctive 
forms if the conveyed speech act has a principal intended effect that involves 
active response on the part of P2. * 

By "active" I mean a response that is viewed as an event in the environment shared by PI 
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and P2: This constraint is used to include utterances such a* the indirect ASK 6.24 while 
excluding those such as the indirect representative 6.25. 

6.24 Would you know if I can get a boarding pass at that desk? 

6.25 -:Would you know that the plane left already? 

I am assuming that both of these utterances are related to *Do you know..." forms. Here, the 
principal intended effect of 6.24 is the "active" response of supplying an answer, while the 
principal intended effect of 6.25 is the more passive response of understanding the 
propositions I content. 

The subjunctive expansion above is written to exclude past actions. This is relevant 
for forms generated from Rule 6.1 For example, although form 6.26 is a common type of 
indirect REQUEST, 627 is not an indirect REQUEST at all. 

6 26 Have you dusted? 

6.27 Would you have dusted? 

The exclusion of past actions in the subjunctive expansion effectively bars the interpretation 
of 6.27 as a R EQUEST to dust. 

An important question that is relevant here it what event or condition is predicated 
by speech act-related subjunctives. This question has received some attention in the 
literature One common answer to this question is that the implicit event is "if PI 
performed the speech act", e.g., "if I asked you" lor REQUEST. Seatle points out in [30], 
however, that while 6.28 may convey a request, 6.2$ may not. 

6.28 Would you pass me the salt? 

629 Would you pass me the sah if I asked you to? 
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Sea lie suggests instead that the subjunctive is predicated on "if you please" or some variant. 
The notion of active response used in the subjunctive expansion above complements this 
interpretation "If you please" is a way for PI to ask P2 to took favorably on an action, and 
that action is the active response associated with the successful completion of the speech act 

6.5. Examples that Fall Outside the Rules 

The final topic for this section is a group of ISAs that are not completely accounted 
for by the approach that has been taken. Consider, for example: 

6.30 Is your leg healed enough for you to go to the store for me? 

6.31 Will you be home in time to walk the dog? 

Looking back at the REQUEST preconditions, these examples *eem to correspond to (II), 
related to P2's capability, but the correspondence cannot be accounted for by the rather 
restricted matching relationship I am assuming (see Appendix B). 

In trying to account for examples such as these, my first inclination was to appeal to 
the definition of can in Precondition (II) and expand the matching rules to include matches 
on parts of definitions That is to say, the semantic representation for can would be 
assumed to have a definition that included notions of physical ability, having free time, and 
being within an appropriate spatial range to do an action. It turns out, however, that this 
treatment puts a great deal of strain on the precondition-based approach without really 
solving the problem exemplified by 6.30 and 6.31. In terms of matching, there is still a good 
distance between a healed leg and the more general notion of physical ability, or between 
the relationship of being at home with a dog and the idea of being in the right range to 
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perform the action of taking it for a walk. There are definite correspondences here, of 
course, but the point is that extabhshing the correspondences can be a complex 
computational problem This is in contrast to the approach t have been taking, where the 
matching process has been carefully constrained. 

Although the approach proposed here appears to be too restricted to handle some 
cases, the answer does not necessarily lie in a wholesale replacement of Rules 61 W 6.10 and 
Appendix B with a more general formulation. Instead, I advocate a layered approach, 
using several sets of rules of varying power. We will have made some progress if we can 
isolate the classes of ISAs that can be derived from properties of speech acts using highly 
restricted rules. These forms constitute a significant class of ISAs, and their restricted 
nature ca*$e exploited in a computational model Other cases cart then be handled by a 
more general, and presumably more expensive, Inferential mechanism. An example of a 
general inferential approach is found in AflenflJ. 
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7. A Taxonomy of Indirect Speech Act Forms 

7.1. Overview 

In this section I present a taxonomy of speech act forms, with emphasis on indirect 
speech acts. The sets of categories in the taxonomy are to be viewed as different 
dimensions, so that utterances can, and typically do, belong to more than one category at 
once I will use REQUEST as a source of examples. The categorization scheme does apply 
to other speech acts, often, however, with some simplification necessary. 

The first distinction that will be discussed is whether the form is direct, indirect, or 
tag. Indirect forms, of course, are of primary interest for this paper It is also, however, 
necessary to consider direct speech act forms for the same reason that it was necessary to 
clarify speech act organization in Section 2: one speech act's direct form is another speech 
act's indirect form. Besides direct and indirect forms, I distinguish tag forms. Tag forms 
are sometimes considered indirect forms, but I will argue for their treatment as a separate 
class. 

The next distinction that will be considered is whether! an utterance is singe-force, 
double-force, or triple-force. Corresponding to this is the trotioa of whether a given speech 
act is conveyed as the immediate, secondary, or tertiary force of the utterance. This second 
distinction corresponds roughly to Sadock's meaning/entaitment distinction, but it is recast 
to avoid the problems described in Section 5. Related to the distinction among forces is the 
question of the existence of speech act idioms, ki Section 7.4 I take a position on this issue 
with the introduction of frozen ISA forms. 
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The final two distinctions, which apply only to what I have called action-centered 
speech acts, have received only casual attention in the literature. The first distinction 
hinges on whether PI names the action involved explicitly or refers to it only implicitly. 
The two ISA categories are, accordingly, rxplicit actio* and mplUU-actkm. For implicit- 
action ISA forms, there is the further categorization into three types, which for simplicity 
will be referred to with numbers. The difference between type i, 2 and 3 imphcrt*action 
forms hes in the type of information needed to recognUe the action intended by PI. 

We can now proceed to a more detailed dtecusitonof each set of categories. 

7.2. Direct Indirect and Tag Forms 

Figure 71 illustrates the first distinction for REQUEST forms. In this figure, slashes 
are used to indicate choices, parentheses indicate optional words or phrases, and angle 
brackets contain descriptions of words or phrases. 

Direct forms come m two varieties: pet/amative and sinplt. The former Is Austin's 
class of utterances in which the speech act is stated explicitly. Simple forms are special 
grammatical or idiomatic strutfures that are associated with the speech act. The simple 
form for a REQUEST is the imperative. Note that the relationship between simple forms 
and speech acts is not one-to-one. The simple form for giving in st r uc tions is, like that for 
REQUESTS.-- an imperative, although possibly modified by an infinitive phrase <eg., To 
open, push in and twist") Warnings abo use the imperative as a simple form. Not only do 
some speech acts share simple forms, but other speech acts have more than one. eg., 
dialogue openers with the simple forms "HeBo* "Hi/ etc. Finally, many speech acts, 
particularly ceremonial ones, lack simple forms altogether and rely on the performative for a 
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DIRECT FORMS 

PERFORMATIVE 

I request that «raction>. 

ex I request that you write a program to manipulate blocks for me. 
SIMPLE FORM 

•imperative of action requested^ 

ex. Write a program to manipulate blocks for me. 

INDIRECT FORMS 

(1) Will <action>? 

ex. Will you write a program to manipulate blocks for me? 

(2) Would <action>? 

ex Would you write a program to manipulate blocks for me? 

(3) Can <action>? 

ex Can you write a program to manipulate blocks for me? 

(4) Could <action>? 

ex Could you write a program to manipulate blocks for me? 

(5) I want <action infinitivex 

ex I want you to write a program to manipulate blocks for me. 

(6) I would like <action infinitivo. 
I'd like <action infinitivo. 

ex. I would like you to write a program to manipulate blocks for me. 

TAG FORMS 

(1) <imperative of action requested>, will you? 

ex. Write me a program to manipulate blocks, will you? 

(2) ^imperative of action requested:-, would you? 

ex Write me a program to manipulate blocks, would you? 

(3) imperative of action requested>, can you? 

ex Write me a program to manipulate blocks, can you? 

(4) <imperative of action requested>, could you? 

ex Write me a program to manipulate blocks, could you? 

Figure 7.1. Some examples of explicit-action forms for REQUEST 



92 

direct expression of the speech act. 

The next class is indirect forms, in which the speech act is conveyed by a i shhpfeform 
for another speech act Figure 7.1 lists a small subset of the possible indirect forms. The 
other sets of categories presented in this section will give us a way to look further at the 
nature of indirect forms. Note that Figure 7.1 does not include negations of these indirect 
forms, e.g.. those involving can't, won't, couldn't, etc. Recall the decision discussed in 
Section 6.3.1 to treate these forms as IS As for the related speech acts for begging and 
persuading (among other possibilities). 

Finally, tag forms are constructed from a simple form combined with the identifying 
part of an indirect form. The identifying pan for most indirect forms is the finite verbplus 
subject (e.g.. "Move it over, would you?"), but in some casesjt encompasses more (e.g., 
"What's the answer, I'd like to know). The ISA forms that may have corresponding tag 
forms can be characterized more specifically *f /men forms,* class which Is defined in the 
next subsection For REQUESTS, the tag forms correspond to the interrogative froten ISA 
forms. This is not. however, necessarily the case for other speech act* Since 1 can give no 
more precise specification of the ISA forms that have corresponding tag forms, this is left as 
a problem for further research. 

A separate category has been allotted for tags, since they may function as an amalgam 
of two forms. (For example, in many contexts "Move over, wou l d you?* has the brusqueness 
of the simple form which is then softened sornewharby the last-minute mtroductior* of the 
more humble "would you") A view which treats tag forms merely as transformations of 
indirect forms misses the dual nature of the message that they caw convey. 
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7.?. The Speech Acts Conveyed 

The next set of categories gives a way to talk about the speech act or acts conveyed 
by an utterance My first assumption is that some, but not aH, utterances convey more than 
one speech act These different speech acts will be referred to as the forces of an utterance. 
For the utterances that convey multiple speech acts, the largest number of such speech acts 
that I have found in English is three. Utterances are therefore referred to as either single- 
force, double-force, or triple-force, and the three force* wrM be called immediate, secondary, 
and tertiary. ? * The speech acts are called only forces, since I wish to avoid appealing to 
any notion of "meaning ." Although if pressed I would have to say that I see aW speech act 

forces conveyed by the utterance as part of the meaning of the utterance, semantic 

36 
investigations are at too early a stage, I think, for a concrete definition of meaning. 

Examples of utterances that convey different numbers of forces are easy to find. First, 

utterances with only an immediate force include direct, simple forms, eg., 5.18 (repeated 

below) uttered with an immediate REQUEST force. An utterance with both immediate and 

secondary force might be 54 uttered with an immediate ASK force corresponding to the 

surface form and a secondary REQUEST force, Finattf . tbei*J* example 5.20 which may 

be uttered with the intent to convey an immediate ASK, a secondary representative, and a 

tertiary REQUEST force. 

5.18 Close the door. 

5,4 Are you able to close the door? 

5.20 Must I tell you that it's cold in here? 



35. My use of the term secondary differs from its use m Searle{30l I hope that this wlH not 
be a source of confusion. 

36. See, however, Zwicky and Sadock [36] and Sadock 125] for important work in this area. 
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To account for the different forces in these examples, let us hypothesize a mechanism 
that applies the rules given in Section 6. (I postpone to Section 75 the issue of how to link 
the statement of the condition in 5.20 with the particular action intended.) With this 
hypothesized mechanism, immediate forces of ISAs are derivable directly from the surface 
form (or a semantic representation of it), secondary forces m* derwed by one application of 
the rules, and tertiary forces are derived by two applications oTttttJUks, 

There are two basic difficulties with tfie mechanism proposed, both of which are 
famthar from Section 5 First, recall Sadock's observations that &t{repeated below) is not an 
indirect REQUEST 

5.1 Tell me if "you can move over. 

We need some way to block the assignment of three forces - REQUEST of a tell, ASK, and 
REQUEST of a move -- to 5.1. At the same time, it is necessary to preserve the possible 
triple-force reading of 5.22. The second difficulty with the proposed mechanism is 
illustrated by the examples repeated here 

5.5 Can you please close the door? 

5.6 ^ Are you able to please close the door? 

With the mechanism proposed, both 5.5 and 5.6 would be produced in the same way. and 
there is no way to account for their differing behavior with respect to please. 

To take care of the first problem with the proposed mechanism, we must first amend 
the rules given in Section 6. Each rule must specify that the speech act form used must be a 
simple one, that is, interrogative for ASK, declarative for the different representatives, and 
imperative for REQUEST. For example, Rule 6.3 is now: 
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Rule 6.3 (for Pl-based preconditions): 

PI can convey a speech act indirectly by - 

-- a simpltform of a representative of the topic of a Pl-based precondition of the 
speech act. 

In addition to this change, the proposed mechanism must be limited to a single application 
of the rules for alt the ISA forms considered here except for one Subclass. The only case 
where double indirection, and hence the production of tertiary force, is uniformly possible is 
a class that will be called type 3 implicit-action forms. 37 This class of ISA forms will be 
discussed in Section 75.3, and in 754 consideration is given to tfie question of why stich 
double indirection should be possible for this class. 

This brings us to the second difficulty with the proposed mechanism. The special 
behavior illustrated by 55 and 56 is one example of the type 6T phenomenon that the idea 
of speech act idioms attempts to account for. Although a relatively small group of ISA 
forms display such special behavior, they have received a great deal of attention in the 
linguistics literature Because of a rather interesting combination of characteristics, these 
forms have been resistant to attempts to fit them into Various theoretical frameworks. After 
several different iterations I find that I come close to subscribing tit tnfe concept of speech 
act idioms. I feel, however, that Searle has made a convincing argument against the use of 



37. As ever, there are some borderline cases. Sofrte indirections on secondary forms do^seem 
to carry three forces, eg. "I'd hte to know if fou could drfte lis to the airport" (an 
immediate STATE.*a secondary ASK. and a tertiary flf^UEST). the examples I have 
found tend to be isolated, however, and they will fee treated here as frozen forms (see the 
next subsection). The only uniformly productive ttass among those considered in this paper 
are the type 3 implicit-action forms. 
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the term idiom. The next subsection, therefore, describes a slightly different class called 

frozen ISA forms. 

7.4. Frozen ISA Forms 

I start the explication of frozen ISA forms with the intuition that the sentence in 7.1 
can be uttered as a REQUEST - not an ASK and a REQUEST, but simply a REQUEST. 

7.1 Will you close the door? 

The first time that I consciously thought about example 71, it took a fair amount of effort 
for me to get to the literal speech act force (a question about a future action). Not all ISA* 
are like this of course. An utterance such as 5J9, repeated below, teems to be primarily a 
representative, and only secondarily a REQUEJST. 

5.19 It's cold in here. 

In searching for the source of this intuition, I come to the fact that the ISA forms that 
I want to call "frozen" can or do display special behavior uncharacteristk of the rest of the 
surface form 1 therefore define frown forms as follows: 38 



A frozen ISA form is an ISA form with an immediate force that dots not correspond to 
the surface form. For such an utterance, one or both of the following obtains: 



?8. It would be nice if special intwational behavior were associated with indirect 
interpretations of utterances, as well. The evidence, however, points to special 
disambiguating intonations associated with some direct interpretations of utterances. <See 
Sag and Liberman {26J) Although this study reports a strong association between some 
intonation contours and indirect interpretations, the intonations did not decisively 
disambiguate indirect forms. 
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1 It can or does display syntactic behavior atypical Of the rot of th€ form. 
2. It can or does display cooccurrence behavior atypical of the rest of the form. 

This definition of frozen forms is almost a direct translation of Sadock's criteria for speech 
act idioms ioto the framework that has been developed here 39 (An addition to this 
formulation, however, is proposed below.) Clause 2 above is illustrated by the special 
cooccurrence properties of example 5.5 as contrasted with example 5* (repeated her*). 

5 5 Can you please close the door? 

56 * A re you able to please close the door? 

An example pointed out by Sadock that fits Clause I is 7.2. which is a passive form of 7.1 

above. 

72 Will the door be (:ptease> closed by you? 

The REQUEST force in 71 is. intuitive!?, more direct ihan that in 7.2. Moreover, the 
special cooccurrence propertied of 71 are test. 

The presence of ptrait in 55 is not an isolated phwiornenon. as Sadock has pointed 

out. Compare: 

7 3 Please, will you close the door? 

7.4 Will you please close the door? 

7.5 Will you' close the door, please? 

7.6 Please, it's cold in here. 

7.7 'It's cold in here please 



^l^e that Sadock's "paraphrase" criterion <[24l chap. 5) has been onotoidftom the 
frozen form definition While the notion of a paraphtase is usefof intuitively, I fee^h* we 
lack a refined understanding of what does, and does not. constitute a Paraphrase of an 
utterance The notion of a paraphrase is therefore not yet a computationally useful one. 
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Even where it»m*\ ptose is possible for secondary REQUESTS (e-g„ 7.§). Sadock points out 
that its sense is slightly different. This use of filiate is more likely to appear in a 
REQUEST that someone stop doing what he or she is either about to do or currently 
doing ^ In addition, for many immediate REQUEST forms, please may occur initially 
without a pause, eg.. 

7.8 Ptease will you close the door? 

7.9 =: Please it's cold in here. 

The non-imperative forms that convey a REQUEST and take ; pleaie freely include 
the indirect forms listed in Figure 7.1. Note, however, that not all cases are as clear-cut as 
the examples given. Consider: 

7.10 *Would you mind please picking up George at the airport? 

7.11 ?WouW you mind picking up Georgeat the airport please? 

There are a number of these cases^or which terminal pleas* seems to be at least marginally 
acceptable. This sort of gradual decay of a criterion* as opposed to a sharp cutoff, is a 
famfftar phenomenon in linguistics The way that frozen forms have been defined, 
however, it is not necessary that the frozen forms for a speech act all display a single type of 
special behavior or that, in this case, please occur freely. It is enough that the form may 
display some syntactic or cooccurrence behavior atypical of the rest of the form. 

Given the definition of frozen ISA forms, we can amend the simple mechanism 
hypothesized above. For each speech act, it appears that the set of frozen forms must be 
represented by its extension. (This applies to the forms, of course, not to the set of the 
individual utterances.) The set of patterns for frozen forms can be used to derive 
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immediate force, given the surface form of the utterance. For frozen forms, this process 
occurs instead of the normal application of the general ISA rules. 

This is an improvement, but the proposed mechanism is still not complete. Since 
frozen forms have an immediate force that does not reflect the surface form, I must account 
in some way for the response behavior discussed in Section 5.2. It will not do in general to 
say that this response behavior results from processing frozen forms as if they were not 
frozen, e.g.. as if the REQUEST force for 7.1 were secondary. If this were done, we would 
be at a loss to account for the special surface behavior such as the unexpected presence of 
please in a non-imperative frozen REQUEST form. 

The best answer, 1 think, lies in assuming that each frozen form pattern has 
associated with it a pointer to the relevant general ISA rule, particularized to a speech act 
appropriate to the surface form. The pointer provides a potential interpretation of the 
utterance, that may or may not be taken. Civen this pointer, no information is lost, and 
responses may be keyed off either the immediate force or the general ISA rule. Such a 
pointer is needed only for frozen forms. In other cases, immediate force corresponds to 
surface form, and keying responses off immediate force is equivalent to responding to 

surface form. 

As a final remark on frozen forms, note that they fall somewhere between Sadock's 
speech act idioms and Searle's idiomatic but non-idiom forms. It is the addition of the 



40. This treatment of the literal speech act force of a frozen form as only a potential force is 
comparable to Morgan's notion of short-circuited implicature as it is applied to speech acts 
[211 Short-circuited implicates are those whose "literal meaning is 'latent' rather than the 
basis for an inference." The pointer to an ISA rule proposed here for frozen forms appears 
to combine information from the occasion and purpose slots in Morgan's scheme. 
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pointer that distinguishes frozen ISA patterns from patterns oC idioms. For these "true" 
idioms, the literal interpretation of the form is totally lost in normal usage, e.g., "How do you 
do?" used in an introduction exchange. For frozen forms, on the other hand, the pointer 
makes a literal interpretation accessible, if not always accessed. White frozen forms are not 
idioms, they are also not merely idiomatic, because, by virtue of possible "cross-over" 
syntactic and/or cooccurrence behavior, they are atypical of the speech act that corresponds 
to the surface form. 

7.5. Explicit and Implicit Actions 

We turn now to action-centered speech acts and consider the distinction between 
actions named explicitly by PI and those referred to only implicitly. Explicit forms are the 
straightforward case; example 5.18 ("Close the door.") is a typical explicit form. Implicit- 
action forms are relatively complex; at least three different varieties of implicit-action speech 
act can be identified. In this subsection, I discuss impucfraction ISAs, taking each of the 
three types in turn. Due to the complexity of the subject, matter, the account wilt be fairly 
detailed. 

Before plunging in, a word must be said about the relationship of implicit-action 
IS As to dialogue context. Implicit-action ISAs rely heavily on context. PI is obliged to 
frame an implicit-action ISA in such a way that P2 can uniquely identify the action, given 
the context. PI must take into account not only the context, but also Pi's knowledge of P2's 
model of the context. Dialogue context plays a progressively greater role for each of the 
three implicit-action types that will be described here. 
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Despite this heavy reliance on context, it appears that the difference between implicit- 
action and explicit-action utterances in this respect is basically quantitative rather than 
qualitative. Although more information must be supplied by context for implicit-action 
ISAs, the same types of knowledge are needed for both classes of utterance This knowledge 
is needed both to choose the correct interpretation for an utterance (eg., to distinguish 
between direct and indirect speech acts) and to round out the specification of an action. See 
Section 12 for a discussion of some of the knowledge sources that come into play for all types 
of ISAs, including both implicit- and explicit-action forms. 

7.5.1. Must you smoke? 

We start with type I implicit-action forms, those for which the action is determinable 
from a combination of the choice of indirect utterance form and the action named. 
REQUESTS do not seem to have any frozen forms of this type, but there is a pattern with a 
secondary REQUEST force, 7.12. 

7.12 Must you <action>? 

Paraphrases of 7.12, eg, "Do you have to <action>?" are also type 1. The use of 7.12 
indicates that a stopping or avoiding action is desired, and in particular the action to be 
stopped or avoided is the one named. For example, "Must you talk so loud?" is a request 
for either the implicit action "Stop talking so loud" or "A*«ki talking so loud." 

Example 7.12 and its paraphrases can beaccounted for by thefollowing rule: 
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Rule 7.I: 41 

PI may ASK about Pi's obligation to perform an action 

- instead of - 
making a REQUEST by ASKing whether P2 can stop or avoid the action. 
(Rule 6.5 acting on REQUEST precondition ID 



Rule 7.1. and the other rules that will be given for implicit actions, represents a link between 
an ISA pattern (the first half) and either a simple form or a pattern produced by one of the 
general rules from Section 6 (the second half)/ 2 

Example 712 and its paraphrases are the only type I implicit-action REQUESTS that 
I have found, but we have the SUGGEST forms "Why not <action>?" and "Why <action>?" 
For these forms, the suggestion is "do <action>" and "stop or avoid <actk>n>", respectively; 

The important feature of Rule 7.1 and the other rules in Ibis, chs* is that the choices 
of intended action (stopping or avoiding the action) are listed explicitly ir* the rule. Context 
plays a role in determining which of the two wa* intended by PI, but the operation for 
determining the implicit action is a choice, not an open-ended search. 

7.5.2. Can you reach the salt? 

For the next implicit-action form, type 2. PI has some action in mind but the implicit 
action is not determinable from surface form alone. Type 2 implicit-action forms name 



41. Earlier versions of Rule 7.1 and the other rules for implicit-action forms explicitly stated 
that the action intended byPI steeuM be clear to m ftw* Mtoirtext, 1 no* bdieve, however, 
that this condition is part of the more general constraint to avoid ambiguity, which applies 
to direct a* well srt.rodifect ipeeth act* &ee!$f*t»*£.M Jt H ifcewtore not necessary to 
restate the principle in each implicit-action rule. 

42. Note that Rule 71 is explicitly tied to REQUESTS. It is not clear that implicit-action 
rules need to be tied so closely to individual speech acts. I have found some rules shared by 
pairs of action-centered speech acts, but further investigation in this area is needed. 
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some component of the implicit action, and the implicit action is determinable from 
knowledge of the structure of actions plus knowledge about the context of the utterance 
Context is necessary because the action component named may be associated with more than 
one action. 

More specifically, for a definition of component 1 draw on the model of actions 
introduced in Section 4. A component is either a state that is a prerequisite of the implicit 
action, a PRINCIPAL-RESULT of the action, one of the action's semantic input cases, or a 
subaction * ? In this subsection, we examine rules based on each of the four types of 
component. 

We start with a type 2 rule based on prerequisites: 



Rule 7.2: 

PI may ASK about or utter a representative about a stative prerequisite of an action 
where P2 is in some way a participant in the state 

-- instead of -- 
making a REQUEST by ASKing about or uttering a representative about whether 
P2 can take responsibility for carrying out the action. 
(Rules 6.5 and 6.6 acting on REQUEST precondition H.) 



Rule 7.2 can be illustrated by an example of Searle's, the use of 7.13 to convey a REQUEST. 

7.13 Can you reach the salt? 

Here, the implicit action is that P2 pass the salt (either to PI or to someone else clearly 
identifiable from the situational context). Instead of naming the action explicitly, PI names 



43. The subactions that can be used in type 2 implicit-action forms appear to be restricted to 
standard path steps (as defined in Section 4). 
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a stative prerequisite of the action. Note that naming a stative prerequisite does not always 
imply the action. One may ask "iMhe salt near y«*?" or "Can yen reach the salt?" tor "Can 
you pass the salt?" but 7.14 or 7ft do not imply 7.16 or 7.17 in response to a request to 

describe the dinner: 

7.14 The salt was near Harry. 

7 \b Harry was able to reach the salt. 

7.16 Harry passed *he salt, 

7.17 Harry was able to pass the salt. 

7.18 I asked for the salt and Harry was able to reach It. 



Example 7.18 does imply that Harry passed the salt, but only because the speech act context 
was reported Thus, the action implied is a property of the particular speech act context 
and not a property of the nature of stative prerequisites alone. 

Turning from prerequisites, the next component type is the PRINCIPAL-RESULT. 
Two rules are of interest here: 



Rule 7.3. 

PI may utter a representative about a desire or need** 

for a state or object that is the PRINCIPAL-RESULT of some action, where the 

desire or need is Pi's or occurs for someone PI is empowered to speak for, 

. -"iH*stead*fr-«. 
making a REQUEST by uttering a representative about a comparable desire or need 
for P2 to take responsibility for carrying out the action. 
(Rule 6.3 acting on REQUEST precondition I.) 



Rule 7.4: 

PI may ASK whether the PRINCIPAL-RESULT of a desired action obtains 

- instead of •" 
making a REQUEST by ASKing whether the desired action has been done. 
(Rule 6.1. Clause iv) 



44. Desire and need together are equivalent to want as 1 have been using it. I used 
"desire or need" here to make the rate read smoothly, but this phrase can be replaced by 
want without changing the sense of the rule. 
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An utterance related to Rule 7.3 is 7.19, which has an immediate REQUEST force. 

7.19 1 want this room clean when I come back, please. 

This example illustrates a property of those implicit-action forms that involve a 
PRINCIPAL RESULT. In uttering 7.19, PI may or may not have a particular action in 
mind On the one hand, PI may want P2 to call the cleaning service, a particular action. 
On the other hand, PI may wish to convey only a desire that P2 find some way to achieve 
the general goal "get the room clean." In this second case, PI leaves the choice of a 
particular action to P2 Note that I will still consider this second use of 7.19 to have a single 
implicit action (e.g., "get the room clean"), keeping in mind, however, that the action is 
specified only in a general way. 

Moving on to the next PRINCIPAL-RESULT rule, Rule 7.4 is similar to 7.3, 
producing REQUEST* such as 7 20. An explicit-action ISA that corresponds to 7.20 is 7.21, 
which is based on Rule 6.1, Clause (iv). 

7 20 Do you have the letter? 
7 21 Did you pick up the letter? 

The component relationship should be clear in this pair of examples: having a physical 
object is the PRINCIPAL-RESULT of a pick-up-physical-object method. In turn, the 
picking up action done is the principal intended effect of an unuttered REQUEST. 

We come now to the third componential relationship, semantic input cases of actions 
named An example utterance is 7.22, spoken as a REQUEST that PI light P2's cigarette. 45 



45. If. on the other hand, 7.22 is uttered as a REQUEST to give PI a match (or matches) 
then 1 would consider the relationship involved to be prerequisite and the utterance to be 
derivable from Rule 7.2. 
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7.22 Do you have a match? 

Here, matches are one possible INSTRUMENT in a light-cigarette method. Thus, we get 
the following rule: 



Rule 7.5: 

PI may ASK whether P2 has <item that fills an INSTRUMENT slot> 

- instead of - 
making a REQUEST by ASKing whether P2 has <item that fills an INSTRUMENT 
slot> for/to <action> 
(Rule 610 acting on the INSTRUMENT case) 



It is not clear to me what other semantic cases may appear in type 2 implicit-action forms. 
Certainly, type 2 IS As based on semantic input cases are similar to the explicit-action forms 
discussed in Section 6.3 2. It appears, however, that type 2 implicit-actions are derived from 
a more limited set of semantic cases. This is a question that I leave to further research. 

This brings us to the fourth, and final, type of component, subaetions. An example is 
7.23, which in the proper context REQUESTS not only a transfer but aho a creation action 
(i.e. write the program). 

7.23 Give me a program that builds an arch. 

Creation actions bring a new object, either physical or mental, into existence, and creation 
actions performed for someone else have a transfer step near the end of the standard 
path(s). In this transfer step, the AGENT of the action transfers control of the object that is 
in the PRINCIPAL-RESULT to the BENEFICIARY. This transfer subactton may be 
named instead of the creation action in a type 2 implicit-action REQUEST. 
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To summarize what has been said about type 2 forms, we can say that these forms are 
recognizable with reference to general knowledge about the components of different actions. 
Components are the prerequisites, PRINCIPAL-RESULTs, semantic input cases, and 
subactions described in Section 4. Type 2 implicit-action rules include some that apply to 
actions in general and some that apply to specific classes of actions. 

7.5?. It's cold in here 

Finally, we come to type 3 implicit-action forms. Type 3 forms are those in which a 
state or action named constitutes a basis for the implicit action. The notion of a basis for 
action will be discussed further below. A standard example of type 3 forms found in the 
literature is b 19, repeated below, used as a REQUEST to close the door. 

5.19 It's cold in here. 

The rule that characterizes type 3 implicit-action forms for REQUESTS can be stated 
informally as follows: 



Rule 7.6. 

PI may utter a representative about a basis for action where 

PI wants P2 to take responsibility for carrying out an action that remedies an 

undesirable state"* 6 

(This state constitutes the basis for action or has a special relationship to it. More 

specifically, the state is "SI" in the definition of a basis for action, below.) 



46 In uttering a REQUEST derived from Rule 7.6, PI reflects a desire to remedy an 
undesirable state. Thus, the negation of the undesirable state is a PRINCIPAL- 
RESULT desired by PI. In the last subsection, I said that implicit-action forms 
involving a PRINCIPAL-RESULT could be uttered with or without a particular action 
in mind We see this quite clearly for utterances derived from Rule 7.6, as evidenced by 
the following common sort of ambiguity: PI: "It's cold in here." P2: "What do you want 
me to do about it?" PI: "I don't care. Just do something" 
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instead of -- 



making a REQUEST by uttering a representative about Pi's desire or need for the 

action. 

(Rule 6.3 acting on REQUEST precondition I.) 



A basis for action may be defined as follows: 



i. a state (SI) that is undesirable 
either: 



because it is in conflict with some goal in the context (from the local context to the 
very general "social context") 



Or 



because when SI is combined with other states m the context it results in a state 
S2, where S2 is in conflict with some goal, as above 



ii. a symptom of SI 

Here, a svnptom is a state S3 that typically coexists with SI, <where when S3 occurs one 
is warranted to check whether SI obtains. 



iii. a signal for SI 

A signal is the communication step of a process that is set up to monitor something 
else -- e g, another process, a physical location, etc - for tfc*«xistence of some state. 



This is quite a bit of material; some examples may help to clarify the definition. First, 
example 5.19 above can be thought of as reporting a state that is in direct conflict with the 
goal of maintaining a comfortable temperature (part (i) of the definition). This type of 
basis for action can be used as an answer to a why-question asked by a non-participant in 
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47 
the speech act, e.g. 7.24/" 

7 24 PI: Why did you close the door? 
P2: Because it was cold in here 



An example of a symptom used as a basis for action is the following: 
7.25 The cake is beginning to smell. 

A cake smelling is a symptom of its being done. The fact that the cake is in the oven <a 
contextual state) combined with the fact that the cake is done can result in a burned-cake 
state, which is generally in conflict with the goal of baking a cake.** 9 Two remarks can be 
made here. Fust, while this sort of analysis of states can always be done, at least in many 

* 

cases I do not believe that it must be done in order to recegniie an ISA. I will expand on 
this point below. Second, note that a symptom need not name an unpleasant situation, as 
7.25 illustrates. 

Finally, we come to signals, an example of which is 7.26. 

7.26 The buzzer went off. 

The oven buzzer could be a signal for the fact that the cake is done, and, by a chain of 
states similar to that for 7.26. this state could lead to a conflict with the goal of having an 
edible cake. 



47. I specify non-participant here because I suspect that, if the question in 7.24 is asked by 
the participant making the REQUEST, then the preferred answer is something Hke 
"Because you told me to." 

48. This combination of states exemplifies the second half of part (i) of the basis for action 
definition. 
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There is still more to be said about the notion of a basis for action, ki particular, 
there is the question of how a basis for action can be related to the general structure of 
actions developed in Section 4. Since the answer to this question involves quite a bit of 
additional detail, it appears in Appendix £>. This permits us to move directly to a 
discussion of the special properties of type 3 forms. 

7.5.4. Special Properties of Type 3 Implicit Action Forms 

In Section 7.3 I observed that the type 3 imphcit-action class constitutes an exception 
to the single indirection limit. In addition to this, there is another property displayed by 
type 3 form*. Compare: 

7.27 Get me a cup of coffee, and I'd like something to eat, please. 

7.28 Get me a cup of coffee, and do you have any pie? 

7.29 «Get me a cup of coffee, and I'm hungry 

Example 7.27 shows a frozen REQUEST form conjoined with a simple form, and example 
7.28 contains a type-2 implicit-action form accounted for by Rule 7.2. (Not atl type 2 implicit- 
action forms are as acceptable as 7.28. but all that I have looked at are at least marginally 
acceptable when conjoined in this way.) Finally, 7.2ft shows a type 3 implicit-action 
REQUEST that cannot be successfully conjoined with the simple form. It appears that type 
3 implicit-action REQUESTS are the only sort of REQUEST forms that may net be 
conjoined with simple forms. 

The uniform possibility of double indirection and the special conjunction behavior 
support the treatment of type 3 forms as a separate class. We stiN, however, need an 
explanation of why such behavior should be exhibited by type 3 forms at all. While I 
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cannot answer this question in detail, I suggest an account based on the degree of 
independence of the speech act forces of an utterance. 

For type 3 forms, the message conveyed by the representative speech act force is 
relatively independent of the action-centered speech act force. The immediate force of a 
type 2 implicit action form, for example, names a component of the implicit action, which is a 
close relationship. In contrast, the "basis for action' relationship in type 3 forms is one in 
which the state or action named is more separate and on a roofe «qual footing with the 

implicit action (see Appendix D). 

To the extent that a speech act force is independent it can be seen as carrying its own 
separate message. The message conveyed by a representative forde of a type 3 ISA is 
apparently separate enough that it merits the possibility of the additional modulation 
afforded by indirection. Similarly, the representative message of a type 3 form is 
independent enough of the other force(s) that, when conjoined with a type 3 action-centered 
form, the representative force competes with the action-centered force for a place in the 
semantic interpretation of the conjunction. (The difficulty here is that simple representatives 
do not normally conjoin with simple action-centered speech acts.) 

Type 3 forms, then, exhibit special surface behavior. I suggest an explanation of this 
behavior based on semantic properties used to define ISA classes. The explanation appeals 
to a notion of the degree of independence between the speech act forces of an utterance. 
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7.6. Summary: Implications for the General ISA Rules 

The categories given divide utterance forms, into direct, indirect, and tag. Utterances 
are said to be single-, double-, or triple-force, and a given speech act may appear as the 
immediate, secondary, ©r tertiary force of the utterance. Frozen ISA farms are those forms 
whose immediate force differs from the speech act in a literal interpretation of the surface 
form of the utterance REQUEST forms (and those of other action-centered speech acts) 
specify actions either explicitly or implicitly, and impbeu-action forms are further classified 
into three types Implicit-action forms are distinguished according to whether the intended 
action is derivable using surface form (type I). whether it is named by a component of a 
method associated with- the action (type 2), or whether a "basis lor action" is appealed to 
(type 3). 

The impact of this taxonomy on the general ISA rules (Rules 61 to 6J0 from Section 
6) can be summarized as follows: 

Additions 

1. The general ISA rules are augmented by implicit-action rules as described in Section 7.5. 

2. The general ISA rules are augmented by a set of frozen form patterns as described in 
Section 7.4. 

Limitations 

1. The general ISA rules are amended to permit only simple speech act forms as discussed in 
Section 7.3. 

2. The general ISA rules are applied once only, except as noted below. 
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Production of Forces 



Immediate Force 



Frozen ISA Forms: produced by a combination of implicit-action rules as necessary 
and the patterns for frozen forms 

Others: directly derivable from a combination of implicit-action rules as necessary and 
the surface form of the utterance 



Secondary Force 

All IS As other than frozen forms have a secondary force derivable by a combination 
of implicit-action rules as necessary plus one application of the general ISA rules. 



Tertiary Force 

Of the forms examined here, only type 3 implicit-action forms (uniformly) may carry a 
tertiary force This force is derivable by a combination of the type 3 implicit-action 
rules plus two applications of the general ISA rules. 
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PART 2 
COMPUTATIONAL IMPLICATIONS 



This empty page was substituted for a 
blank page in the original document. 



117 



8. Computational Perspectives on ISAs 

8.1. Drawing Computational Implications from the ISA Categorization 

The first part of this paper developed a characterization of indirect speech acts and 
justified it with traditional (non-computational) sorts of linguistic arguments. Now, in Part 
2 some of the computational implications of this ISA categorization will be considered, the 
main point of Part 2 is that the distinctions made in Part 1 between ISA types occur at 
precisely the places where computational processes should be distinguished. 

The computational implications that 1 wish to explore will be presented in the context 
of a process model for ISA recognition. ^ That is to say, I will be concerned with relating 
utterances to an appropriate underlying representation. The following two questions will, 
together, provide a focus for Part 2: 

1. Given an utterance form, how can one represent, in a computationally useful way, 
the possible speech acts it can realize? 

2. How can the system for characterizing ISAs from Part I be related to a view of 
recognition as essentially a path-huilding process? The pa|h$ in question are links 
between the speech acts conveyed by an utterance and nodes on a tree representing 
the events of the4ialogue. (This view will be developed in Section 9) 



4<> I me the word recognition rather than unrfmf«arfinf to emphasize the importance that 
expectations will play in the process model; see Section II. Note that only recognition will be 
considered here 1 see generation as at once an easier anc* a more difficult probfeRj- On the 
one hand, the speaker is spared some of the problems of ambiguity and incomplete 
knowledge about what is said, making the generation, process sirnplej. On the other hand, 
the generation process is complicated by a need to take the recognition process of the hearer 
into account, to avoid ambiguity when this is necessary or desirable. 
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For the answer to the first question, I draw on the distinctions from Part I. 
Depending on category, the model will use explicit lists of realization patterns or rules at 
various levels of generality. 

In answering Question 2. 1 will be concerned with the type and number of paths to be 
constructed for each utterance form The immediate/secondary/tertiary distinction from 
Section 7.3 is seen as directly affecting this question, with one path buik for each force. 
Another factor here is the direction in which paths are constructed; choice of direction has 
an impact on the representation of ISA forms. 

Although these questions were stated separately, they are actually strongly interrelated. 
Accordingly, after a general discussion of recognition in Section 9, I discuss the questions 
together in Sections 10 and II and go on to took at some aspects of the role of context in 
Section 12. Note that the answers to the two questions are not intended to be complete. In 
no sense am I attempting to present a full computational model of ISA recognition. I have 
concentrated on what I see as the more important computational implications of the theory 
presented in Part I, with the hope that the ISA recognition scheme outlined in Part 2 will 
eventually form the basis of a more complete computational model. 

Every effort has been made to malce the discussion of ISA recognition 
implementation-independent. It is worth mentioning, howeveMhat parts of the model have 
been implemented in a system described in [31 This system, called Susie Software, was 
designed as a prototype dialogue module for an automatic programming system: While the 
system's debugged behavior was limited to a twenty fine dialogue (phis some close variants), 
it is fair to say that the implementation did nothing to disconfirm the model presented here. 
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The implementation process did, however, reflect negatively on some earlier versions of the 
model, and in that sense it was an important exercise. 

8.2. A Look at Related Work in Dialogue Modelling 

Before plunging in, it may be useful to see where the approach to dialogue (and ISA) 
recognition that I am advocating fits in with other recent work on dialogue. I start with the 
OWL-i representation of actions discussed in Section i. 

OWI.-I methods represent chunks of knowledge in the tradition of Minsk y's frames 
[19] and Schank and Abelson's scripts [27], and as such they should be familiar types of 
structures. Both of these approaches are geared to representing common sense knowledge, 
frames primarily from the point of view of visual knowledge and scripts from the point of 
view of social actions. Although Minsky does discuss the application of frames to language 
understanding, he does not address the special problems posed by dialogue, where language 
generation and recognition are intimately related. Schank and Abelson's focus on social 
actions brings them closer to problems of dialogue, but their representational scheme is quite 
different from the one adopted here. Scripts are built from a small set of primitive actions, 
in contrast to the open-ended set of actions used in the OWL-I methods. Despite the 
differences, the three approaches have one very important point in common. All three focus 
on chunks of knowledge whose interrelationships are used to form expectations to guide 
recognition processing. This idea plays a central role jn the framework described here. 

Of the specifically language-related research, four efforts are especially relevant here. 
The first is work by Grpsz [8X12], who fOcussed attention on the importance of task 
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knowledge for dialogue. A semantic network was used to represent the structure of tasks 
such as assembing an air compressor, and dialogue utterances were interpreted with respect 
to this task knowledge. Using this approach, Grosz has been able to develop important 
insights on reference and pronominalization. 

Although Crosz's work shows how task knowledge affects dialogue, there is no 
explicit model of more general dialogue knowledge. One early effort to represent both 
linguistic and non-linguistic actions uniformly was that of Bruce [53. The OWI.-I dialogue 
method is quite close to Bruce's social action paradigm, in that both are centered on 
semantic cases and both were developed with an eye to incorporating speech acts. The 
primary representational difference is the distinction made in OWI.-1 methods between 
standard and recovery paths (Section 4), which has important implications for processing as 
well as for representation. 

Bruce's approach, then, points the way to an integration of linguistic and non- 
linguistic knowledge in a single representation scheme. There is still the problem, however, 
of showing how such general representations apply to particular situations. (In [51 Bruce 
discusses the use of social action paradigms for story understanding but not for dialogue.) 

A group that has worked on a mechanism for relating representations to individual 
dialogues is Moore, Levin, and Mann (see [20]). Their work is specifically dialogue-related, 
and their representations for what they call dialogue games are spiritual cousins to Bruce's 
social action paradigms and to OWt.-l dialogue methods. The major difference is in the 
approach to control structure; general dialogue game structures are related to particular 
dialogues by a group of processors that work independently and in parallel 061 In contrast, 
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the design that will be discussed in Section 9 and Section II has a highly centralized control 
structure where every effort has been made to identify limited sets of information necessary 
for each choice This means that an attempt has been made to limit searches and match 
attempts by structuring information so that the system knows where to took for its 
information To an extent, this view is influenced by the current serial machine 
architectures, but I suspect that the more structure we can identify in knowledge -- both 
linguistic and non-linguistic - the more successful will be our models. 

Another group that has been concerned with both representation and mechanism for 
dialogue is Cohen, Perrault. and Allen (Cohen [61 Perrault et al. [22]). They have been 
interested in specifying a general, inference-based mechanism for choosing and recognizing 
speech acts, given the facts of a situation and the goals of the speaker. Speech act 
generation is viewed as a general planning problem, and recogition is viewed as a plan- 
recognition problem. This approach differs from more strongly expectation-based 
approaches, which include the other efforts described here as well as the one to be 
presented. 

Within this plan-recognition framework, the work that is of most relevance for us 
here is that of Allen [1], who concentrates on ISA recognition. Of the three theories 
discussed in Section 5. Allen's framework is closest to that of Searle. The model that Allen 
presents uses a general inference mechanism to identify speech act forces beyond what I am 
calling here the immediate force. The inference mechanism uses knowledge about the 
structure of actions, as well as knowledge about the planning process. As noted above in 
Section 6.5, such a general approach is probably necessary for handling some types of ISAs. 
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My claim, however, is that, for a large class of ISAs, there is knowledge that can be 
exploited to avoid deriving a form "from scratch" each time it is recognized. The design 
discussed here relies heavily on predigested information associated with ISA patterns. This 
allows more aggregated operations, so that one operation may correspond to a segment 
containing several inferences in Allen's system. 

To sum up, the approach that will be taken gives a central position to expectations in 
the recognition process It recognizes the importance of task knowledge in framing these 
expectations, integrating this knowledge in with linguistic knowledge in a uniform 
representation Efforts have been made to consolidate control structure where possible, and 
an attempt has been made to tailor operations to different categories of ISAs, in order to 
exploit knowledge that we have about them. 
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9. A Framework for Recognition Processing 

The recognition process for ISAs cannot be discussed outside of a general model of 
recognition. In Section 91 1 briefly discuss the system configuration I am presupposing, and 
in Section 9.2 1 go on to recast recognition as a path-building process. 

9.1. A System Configuration for Recognition 

In this section. I discuss only those aspects of recognition that are required for the 
discussion of ISA recognition. Figure 9.1 shows the recognition processing configuration 
that the ISA model assumes 50 Many aspects of the configuration in Figure 9.1 are 
influenced by the wish to model dialogue as, essentially, the execution of OWI.-I methods. 
This means that a knowledge base is needed to hold the method libraff (among other 
information), a knowledge base processor is needed to maintain it. and a method interpreter 
is needed to move from step to step and invoke and terminate methods. 

Knowledge Base and Knowledge Base Maintenance 

The Knowledge Base contains concepts, the basic unit of 0*1.-1, and structures built 
out of these concepts These structures include OWL-I methodi to represent processing 
knowledge as well as descriptive structures for classes of static concepts (eg. substantives 
such as toy blocks or computer consoles). The Kno*fodgfcBaseJ»*fiMenance facility needs 



50. Figure 91 has many points in common With *he<©WI.-l implementation, done at the 
MIT. Laboratory for Computer Science by William A. Martin, LoweH Hawkinson, William 
Long, William Swartout, Alexander Sunguroff, and the author. That system contained 
running examples of everything in 9.1 except the English Front End. 
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Figure 9.1. The recognition system assumed 
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the ability to add new concepts and structures to the Knowledge Base and to modify old 
ones. 

Interpreter 

It is the Interpreter's job to "execute" method steps to generate and recognize 
utterances in the dialogue If request-and-respond, ask-and-answer and the other dialogue 
methods are speaker independent, then it is up to the interpretation process to determine 
whether a particular utterance is to be generated or recognized. There are, in fact, three 
possible modes of interpretation" for a step in a dialogue method: 



1. Carry out the step (e.g., ask a question). In dialogue, the most important role of 
Carry-out is to generate utterances, but it is also used to execute "mental" actions. 

2. Recognize that a step has happened (e.g., that an answer to your question has been 

given). 

3. Assume that a step has happened (e.g., if your conversational partner gave the 
answer, then he or she had to perform the mental process of finding the answer first.) 



A set of simple rules is adequate in most cases to determine the mode of a step from the 
semantic input case settings in the procedure call. 

In order to introduce other system modules, I will expand on the Interpreter's Carry- 



51. Throughout Part 2 of this paper, the words intttpret and interpretation will be used to 
indicate the interpretation of methods. The meaning of interpret found in natural language 
interpretation (as opposed to natural language generation) will be conveyed by recognize. 

52. Semantic input cases were discussed in Section 4. In some conversational settings it is 
ambiguous whether, say, recognition or merely assumption should occur next. "Constructed" 
dialogue environments can avoid this problem by careful design. In other dialogue 
environments, clarifying discussion can often be used to determine the currect mode in an 
ambiguous situation. 
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out mode. (Recognition mode is discussed further In the next subsection.) Stative steps (eg. 
results) are handled in Carry-out mode by simply asserting them into the Knowledge Base. 
The handling of non-stative method steps is more complex. Given a non-stative step, 
Carry-out first evaluates it with respect to the current environment (Evaluate is discussed 
below ) This evaluated step is then treated as a procedure caW and matched against known 
methods to find appropriate subprocedures. Restrictions on what can fill each input case 
associated with a method are used in this matching process. The use of the evaluated call 
means that the choice of a method is heavily influenced by the progress of the dialogue to 
date. 

Evaluate 

OWL- 1 forms are a combination of regular concepts and variables. The module 
Evaluate takes these forms and looks up and binds appropriate values of the variables. 
The evaluation process is always done with respect to some envfrofmierit, for example when 
a call is evaluated with respect to the current dialogue environment before the search for a 
method to carry it out. 

English Front End and Reference Matcher 

The English Front End consists of morphological and parsing modules for typed 
input and includes additional signal proirtssifig rnTtflWer^ spoken input. The Reference 
Matcher is not limited to resolving noon group rererences; hnttead, the "entire utterance is 
seen as referring to semantic structures, and the Gist of the Reference* Matcner is to identify 
these concepts. This identification process can either be done by composing new 
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representations, by verifying that the utterance matches an expected representation, or by 
some combination of the two Note that the arrows in Figures 9.1 and 9.2 are intended to 
mark the dominant flow of control Some interaction with adjacent modules is expected for 
both the Front End and Reference Matcher. 

For our purposes here, it is the output of the Front End and the Reference Matcher, 
rather than their internal operations, which is of interest. The representation that we need 
from the Front End will be called surface semantic representation, so-named because its 
elements are semantic structures but its form corresponds closely to the surface English of 
the utterance The Reference Matcher takes this representation as input and produces as 
output Interpreter level representation, the semantic language used by the Interpreter. 
Figure 9.2 illustrates the sorts of representations that I am assuming The lexical items 
correspond to OW1.-I concepts Parentheses are used to estabR* gtouping relationships; 
they have no relation at all to LISP parentheses. The primary differences between the 
representations in Figure 9.2 is that, where possible, referent identification and semantic case 
assignments have been made in the Interpreter level representation. Further details of the 
representations assumed are given in Appendix E. 

Event Tree 

The Event Tree is built by the Interpreter as. a record of methods executed in the 
course of a dialogue. In this regard, the Event Tree «an be thought of as intermediate term 
memory, used to record the current dialogue and organised chronologically. Past events are 
not removed from the treev so that they are available for inspection, question answering, 
resumption (in the case of uncompleted events), etc. 
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Can you pass the salt? 

Surface Semantic Representation 

((SAY INTERROGATIVE) 
(CAN 
((PASS (SALT THE)) 

(SUBJECT YOU)) 
(SUBJECT YOU))) 



Interpreter Level Representation 



t 



[(ASK 
(WHETHER 
[<CAN 
[(PASS SALT-I) 
AGENT: P2 
DESTINATION: PI} 
AGENT: P2] 
AGENT: PI 
DESTINATION: P2] 

Figure 9.2 Example representations for an utterance that conveys (only) 
an ASK. 



It is to the branches of the Event Tree that I am referring when I speak of 
recognition as a "path-building" process. The branches of the Event Tree correspond to 
substeps of methods. Figure 9.3 is a very simple example of an Event Tree configuration 
for a computer console session environment in which a question is asked by the user and 
answered by the system. Note that each node on a branch either has a subpart relationship 
with its superior (eg, ASK and ask-and-answer) or it Is a more specif ic description than its 
superior (eg;, "find out some information" and ask-and^nswer). Surface semantic 
representations of utterances do not appear as separate nodes on the tree, but they are 
instead associated with the speech act node. The association between surface semantic 
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representation and event node corresponds to the relationship between ISA and speech act. 
Note that the example in Figure 9.3 is quite a bit simpler than most. Once a dialogue gets 
underway there are typically many levels of subcalls on the Event Tree branches? 3 

Two of the speech acts in Figure 9.3 have not been introduced previously. The first, 
STATE, is a representative that conveys what PI believes to be a fact. Quite a bit must be 
said in order to adequately define the notion of a "fact" but I appeal instead to the reader's 
intuition The other new speech act, ACKNOWLEDGE, is also a representative that 
conveys information about Pi's status with respect to the dialogue. It is a sort of marker, 
indicating that PI has completed a step in the dialogue and is ready to continue. 

9.2. The Event Tree and Recognition Mode 

This section contains a very condensed outline of the recognition process that I am 
advocating, which 1 have characterized as a "path-building process." Although the 
information content of this section is dense, everything relates to a single major point. This 
point is that path building is a process that occurs in an environment of incomplete 
knowledge, so that it is computationally important in what direction path building is 



53. There is another way in which the treatment described is a simplification. In many 
dialogue situations, utterances occur whose motivations are learned only later. These 
motivations could have an effect on the choice of some of the higher level nodes on the 
Event Tree path of the original utterance. (The level of concern here is the "get some 
information" node in Figure 9.3.) The usual options are available to the system builder: 
attaching a path to the tree can be deferred until further information is available, the 
system can guess and back up, or minimal commitments can be made (e.g., "get some 
information" can be chosen temporarily as a motivation). If the last alternative is chosen, 
the descriptions on the event path can be refined as more information comes in. For the 
dialogue modelling task, this third alternative is probably the most suitable in general. This 
approach avoids problems of non-uniform representation that «ccur when attachments are 
delayed, and it avoids the need to undo premature and erroneous commitments that may 
occur with a guess and back up strategy. 
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participate in a console session 
agent: PI (the computer system) 
co-agent: P2 (the user) 



get some information 



ask-and-answer a question 



ASK a question 
agent 
destination: PI 




decide whether to answer 
agent: PI 



find the answer 
agent: PI 



state-and-acknowtedge 

the answer 
agent: Pt 
co-agent: P2 




STATE the answer ACKNOWLEDGE the answer 
agent. PI age»t?P2 

destination: P2 destination: PI 



"What time is it?" 



"I&05 A.M.' 



"Thanks. 1 



Figure 9.3 An Event Tree configuration wfth paths for a question/answer exchange. 
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attempted. I outline here a suggested approach based on a distinction between utterance 

types. 

Recall that methods are being used to model a task oriented dialogue from the point 
of view of one of the participants (say. PI) so that some method steps correspond to all or 
part of the underlying form of generated utterances and some provide underlying forms for 
utterances that are recognized. Once PI uses a method step to generate an utterance we can 
identify a set of patterns that are highly likely as underlying forms of the response by the 
other participant P2, given that the dialogue continues along the same lines. This set of 
patterns includes the possible active successor steps to the generated step, the lead-ins (i.e. 
first step executed in recognition mode) to recovery paths related to the generated step, pr 
lead-ins to recovery paths related to its successor steps. (Recovery paths were introduced in 
Section 4) Here, active is used to rule out strictly conceptual steps such as thinking of an 
answer Since P2 need not respond only to Pi's goals, I include twct other types of pattern, 
initiator and metadiscussion. Initiators are utterances that start off new, independent tasks 
(as opposed to substeps of tasks currently underway). Metadiscussion utterances are about 
the dialogue rather than strictly step-derived They tend to clarify or change the flow of 
the dialogue (For more on these different types of utterances, see Brown [4].) The different 
patterns corresponding to the different types of .utterances ?•« be caHed ^uctural 
expectations. 54 Structural expectations developed from the current configuration of the 

54 The term structure is used here for what others might call the syntax of dialogue. Many 
"structural" phenomena are semantic in flavor (although they do not necessarily vary 
according to the specific semantic domain), and the use of the term syntax might be 
misleading. For example, the fact that questions get answers can be called "dialogue 
syntactic;" however, the 4act that answer* may b*y.e .associated $puJat«>ns CYes, if ... ) or 
qualifications ("Yes, but.-") begins to stretch the term syntmic. The .expectations generated 
from methods and the general task environment are therefore referred to as structural 
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Event Tree can be viewed as "on deck", waiting to participate in the matching process. 
When a match against a structural expectation is successful, a path of dialogue events is 
created, linking nodes for utterances' to expectations. This path is attached as a branch on 
the Event Tree (of this, more below). 

Figure 9.4 illustrates the notion of a structural expectation, again With the simple 
example of a question asked in a console session environment. Thittime, however, it is the 
system that asks the question If all proceeds according to the standard path of the ask-and- 
answer method, the user will first find the answer (an assumed step from the point of view 
of the system) As the ask-and-answer method is written now, the next step is a call to state- 
and-acknowfedge the answer. Thus, stale-and-acknowtedge is oikr expectation, although the 
Interpreter must go through a layer of calls and method searches to find the speech act 
expectation STATE ft is the STATE step that teat the same level of aggregation as the 
utterance. w Since the user may decide not to answer the question, other structural 
expectations are possible, among them lead-ins to recovery paths associated with ask-and- 
answer, lead-ins to general recovery methods, initiators, and metadiscussion Figure M 
shows one such possibility, a STATE that is the lead-in step to the recovery path that says 
that P2 does not know the answer. (This STATE is not represented as a substep of state- 
and-acknowledge, because the expected responses are different.) 



55. Additional mechanism, however, is needed to bridge the gap between the speech act (eg., 
STATE) and the utterance. The sections that follow consider the mechanism necessary to 
bridge this gap for IS As. 



133 



EVENT TREE: 

participate in a console session 
agent: PI (the computer system) 
co-agent: P2 (the user) 



get some information 



ask-and-answer a question 




<POSS!BLE 

CONTINUATION 

POINT> 



ASK a question 
agent: PI 
destination: P2 

J 



STRUCTURAL EXPECTATIONS: 



state-and-acknowledge 

the answer 
agent: P2 
co-agent: PI 



STATE the answer 
agent: P2 
destination: PI 



STATE that P2 doesn't 
know the answer 
agent: P2 
destination: PI 



"What time is it?" 



Figure 9 4. An Event Tree configuration with a structural expectation 
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Note that although it was straightforward in the example to get from "state-and- 
acknowledge the answer" to the speech act expectation STATE, in general there may be a 
number of intermediate nodes between an expectation from the standard path of a method 
and the possible speech act stepXs^ Each node on the Event Tree can be a choice point, 
where one of several methods is chosen to match the caH. Since the goals erf the agent can 
influence the choice of method, recognition of an utterance, i.e. conrtfiKtrog the appropriate 
Event Tree path, is generally done in an environment of incomplete knowledge. It is not 
immediately clear, therefore, whether a path should be built top-down (from most aggregate 
expectation to speech act expectation), bottom-up, or using some hybrid approach. 

The answer to this problem suggested in [3] is to use different path-building 
strategies for different basic utterance types. Recognition procedures for initiators, 
metadiscussion, and the more general variety of recovery discussion which tend, particulary 
in task-oriented dialogue, to be only weakly determined by context, can depend heavily on 
the utterance itself. In terms of the Event Tree, this means that paths are built essentially 
bottom-up. Recognition procedures, for standard path successor steps and recovery 
discussion that is closely tied to the structure of the dialogue can be driven by the most 
aggregate structural expectations (evaluated with respect to the current environment), since 
in this case the expectations will tend to embody the better information. For these types of 
utterance. Event Tree path construction can proceed In a generally top-down direction. 
Using such a mixed strategy, the recognition process can take advantage of the context 
supplied by the method representation and use different, more utterance-centered strategies 
when strong contextual information is not available. 
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10. General Observatio ns about ISA Recognition 



Given the general process model for recognition outlined in the last section, I can now 
discuss issues related to ISA recognition. In the first two subsections I consider the 
implications of the recognition framework for ISA processing. The final two subsections 
discuss the problem of ambiguity for ISAs. 

10.1. The Role of the Recognition Framework 

The structural expectations and the basic utterance types introduced in Section 9 can 
play a role in the recognition of ISAs. Within this framework, most of the examples used so 
far can be thought of as initiators For example. "Can you pass the salt?" can initiate an 
independent task associated with eating a meal. Not all ISAs, however, occur as initiators. 
Consider, for example, the process of paying for merchandise with a credit card. If, some 
way along in the process, the salesperson utters 10.1. this is typically a REQUEST, not 
merely a representative. 

10.1 I need your signature. 

I claim that a good way for a process model to determine the correct speech act is to use the 
fact that the pay with-credit-card method will contain a step to the effect that the 
salesperson REQUESTS the customer to sign the credit slip. Using this structural 
expectation, the Interpreter knows that it is looking for * REQUEST force (possibly among 
other alternatives). It can then ask whether the incoming utterance can be construed a* a 
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REQUEST. More precisely, to recognize an utterance as a standard path or recovery path 
successor step or as a lead-in to a recovery path, the Interpreter can do what is necessary to 
find the speech act(s) (i.e by hypothesizing possible Event Tree paths top-down from 
evaluated method steps), find the ISA forms associated with this speech act (either stored 
explicitly or in the form of general rules), and ask whether the utterance matches any of 
these forms. 

In contrast to standard and recovery paths, context is generally weaker for initiators, 
general recovery discussion, and metadiscussion." In these cases, I have proposed that the 
Event Tree path(s) be grown bottom-up. For ISAs, this means starting with an ISA form 
and asking what speech acts it could convey. 

In thinking about ISA representations and the details of recognition, then, I will be 
considering both directions: ISA form to speech act (henceforth abbreviated as ISA -> SA) 
and speech act to TSA form (abbreviated as SA -> ISA). This bi-directional approach 
follows from the choices made in the general recognition model. Since both directions are 
covered, it also means that the discussion in the following sections should be translatable to 
dialogue process models that are similar to the one proposed *«FT!»lii«^ttse»*dlflef>ent mix 
of bottom-up and top-down strategies to relate utterances to expectations. 



56. Here and elsewhere, when I talk in terms of utterances matching ISA forms, I really 
mean a match between the surface semantic r ep re s e nt at i on <ef the turefanc* and the 
transitional representation of the ISA form. See Appendix E for a discussion of these 
representafftins. 

57. It is not, however, clear to what extent metadiscussion uses ISAs, since these utterances 
tend to fail into rather fixed patterns. 
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10.2. ISAs and the Event Tree 

In order to represent ISAs, one Event Tree path is built for each speech act conveyed 
by an utterance 58 Given this approach, example 5.19 ("It's cold in here" uttered as a 
REQUEST to close the door) would have the Event Tree informally represented in Figure 



10.1. 



<header of the highest level method 
for this task-oriented diatoguo 




give information 



state-and-acknowledge 
the information 



STATE that 

the room is cold 



get some task done 



request-and-respond for 
the task 



REQUEST that 
PI close the door 



surface semantic representation of 
"It's cold in here" 



Figure 10.1. A possible Event Tree configuration for Example 5.19. 



58. An utterance may, of course, suggest more than one possible set of Event Tree paths, 
and the choice between them may not be obvious. This is a problem of ambiguity, which 



will be discussed in the next two subsections. 
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Recall that the surface semantic representation of the utterance is associated with speech act 
nodes but is not itself an actual nod* on the Event Tree. 

Despite the multiple path represwuattoh given to some utterances, ISAs in task- 
oriented dialogue are typically motivated bf a single primary hierarchy of goals and 
subgoals held by PI. That is, in the usual case, one of the speech acts conveyed reflects ttt* 
primary motivation behind the utterance ^Moreover, the -speech act force that is the most 
important will be the final one, i.e., secondary for utterances with immediate and secondary 
forces, tertiary for a three-force utterance. For the "It's coM in here" example the primary 
goal hierarchy would include getting the room warmer by getting the door shut, and thus 
REQUEST can be thought of as me more important of the two speech acts conveyed. 

The observations in the last paragraph have two implications for the general 
recognition mechanism that has been outlined. First, in those cases where I advocated top- 
down processing (Section 9.2), if an dtferan^by PI conveys muHipiei forces, only one of the 
forces will match a structural expectation related to the current configuration of the Event 
Tree. This force can be thought of as conveying the primary goal of PI. Second, the force 
that matches the expectation will be the final one. Figure W.2 illustrates the type of Event 
Tree configuration that 1 am assuming. Figures KB a»id K)4 illustrate cases that I am 
assuming to be non-standard in task-oriented dialogue, because they contradict the first and 
second assumptions, respectively. 



59. I emphasize that this restriction applies to the usual case. Exceptions ate, of course, 
possible, but I would expect to see additional mechanism called in ta handle them. 
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buy merchandise with credit card 
agent: PI (the buyer) 
co-agent: P2 (the salesclerk) 



intervening substeps> 



fill out credit slip 
a pent: P2 
co-agent: PI 



request-and-respond 
for PI to sign the credit slip 



STATE that REQUEST that 

P2 needs PI to sign PI sign the credit slip 
the credit slip 



surface semantic representation of 
"I need your signature." 



Figure 10.2. Event Tree paths for a two-force ISA. 



HO 

In Figure 102, note that the STATE and REQUEST nodes would be marked with 
the same time and surface semantic representation, so there would be no chance of 
construing them as totally independent events. 



TASK! 



TASK? 



SPEECH ACT I 



SJP£jE£HACT2 



IMMEDIATE FORCE 



SECONDARY FORCE 



Figure 10.3. A questionable Event Tree configuration for an ISA 

Figure 10 3 is only skeletal, because I have flOt found Examples of utterances that relate 
to two independent tasks (or independent parts of the same task) by a split in their speech 

act forces. * 4 ; T 

Finally. Figure 10.4 says that one can answer a simple question about the temperature 
of the room with an utterance that is, secondarily an indirect REQUEST. I claim that 
Figure 10.4 is an ill formed Event Tree, if it is taken to represent one of the ISA types 
discussed in this paper. Note that the as^mpjion ^tt «£ji the final force can match a 
structural expectation related to the Event Tree implies that, once such a match is found, 
there is no need to derive further indirect forces. 

In summary. Event Tree ISA representations involve one path per speech act force. 
For top-down recognition processing, however, only one path matches a structural 
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ask-and-answer 




ASK 

what the temperature of the room is 

on a scale of cold, medium, and hot. 

agent: P2 

destination: PI 



find the answer 
agent: PI 



state-and-acknowledge the answer 
agent: PI 
co-agent: P2 




STATE that 
the room is cold 



REQUEST that 
PI close the door 



surface semantic representation of 
"It's cold in here." 



Figure 10.4. Another questionable Event Tree configuration for an ISA. 
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expectation related to the Event Tree -- the path generated for the final speech act force. 

10.3. Some Remarks about ISA Ambiguity 

The last two subsections glossed over the problems of ambiguity, and ! would like to 
remedy that situation now. In the recognition framework described in Section 9, 
ambiguities with respect to speech act force occur when an utterance matches more than one 
structural expectation. A choice must usually be made as to which expectation was 
intended. In this subsection, I will- make some general observations about, the process of 
making this choice. Three points will be discussed: 

1. the relationship of disambiguation reasoning and reasoning done for other 
purposes 

2. the effect that a dialogue environment has on the disambiguation process 

3. systematic versus nonsystematic ambiguity between the ISAs conveyed®" 

Starting with the relationship of disambiguation reasoning to other sorts of reasoning, 
the most important point is that disambiguation reasoning is reasoning about the world 
model of the conversational partner. For example, if PI makes an utterance to P2, any 
reasoning that P2 does about the need for an action to be done, etc. is done with respect to 
what P2 believes about Pi's worlds model This is not necessarily the same as P2's model of 



60. Searle in [30] has questioned the use of the term ambiguity to refer to differences in the 
forces conveyed by indirect forms. Since the notion of meaning that I amusing (see Section 
7.3) is broader than Searle's, and, since differences in conveyed forces amount to differences 
in meaning as I construe it, I will continue to use disambiguation to mean choices among the 
possible ISA forces of an utterance. 
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the world (or for that matter as Pi's; this is a common source of error). This said, I wilt not 
say much more about the divergence of world models. Recent work in this area Is that of 
Cohen [6] To justify limiting the scope of this paper, I appeal to what appears to be the 
default assumption: 

If there is no information to the contrary, assume that the dialogue partner's world 
model is the same as your own. 

So, although 1 will note places where dual world models become a factor, I will assume for 
the purposes of exposition that there is "no information to the contrary." 

Reasoning for disambiguation, then, does not always use the same knowledge sources 
as general reasoning. A further point about disambiguation reasoning is that it can be 
quite complex Where REQUEST is a possibility as a conveyed speech act, it may be 
necessary for P2 to evaluate capabilities, willingness* authority relationships, schedules of 
activities, etc. This sort of evaluation can be an expensive process. Note, however, that the 
same information needed to disambiguate is often also needed to decide whether to comply 
with a directive or to determine whether information conveyed in a representative was 
previously known. Thus, a mechanism that can save its reasoning processes - or at least 
save those steps and results known to be of potential interest later on- can avoid 
duplicating what appear to be inherently expensive processes. 

This brings me to my second point, which relates to the effect of the dialogue 
environment on the disambiguation process! One important aspect of dialogue is the fact 
that PI is available to aid P2 in disambigating Pi's utterances. Dialogue participants quite 
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commonly ask for clarification of intent, eg,, 102 to a "Do you know...?" question or 10.3 to a 
"Can you <action>?" form: 

10.2 Are you asking me or telling me? 

10.3 Yes. Do you want me to? 

Another frequently used strategy is for P2 to make a disambiguating choice clear in a 
response An example here is 10.4 in response to a "Can you <actton>?" form interpreted as 
a REQUEST: 

10.4 Sure I'll get started right away. 

One result of this sort of response is that PI can go on to alert P2 if P2's interpretation was 
not the one that PI intended. 

The dialogue environment, then, provides additional opportunities for aquiring 
disambiguation information and, if responses are constructed carefully, it provides checks to 
prevent misunderstandings. 6I Disambiguation in task-oriented dialogue may be difficult, 
but the options available effectively limit how hard a dialogue participant must work. 

We turn now to the third topic of this subsection, the distinction ..between systematic 
and nonsystematic ambiguity in the speech acts conveyed by an utterance. A systematic 
ambiguity for ISAs occurs when all forms within an ISA subset display a particular 
ambiguity with respect to the speech act conveyed. Systematic ambiguities for ISAs include 
ambiguities that occur between direct and indirect immediate forces of an utterance. This 
type of ambiguity is common to all ISAs that do not contain special identifying 



61. For a systematic approach to differences among different linguistic modalities, see Rubin 
[231 
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characteristics (eg., some frozen ISA forms as described in Section 7.4). A familiar example 
is whether 5.3 has ASK or REQUEST as its immediate force. 

5.3 Can you dose the door? 

Another type of systematic ISA ambiguity is whether or not a given secondary force applies. 
This type of ambiguity is common to all ISA forms which may convey a secondary force, 
and the familiar example is whether 5.4 conveys a REQUEST in addition to the immediate 
ASK force. 

5.4 Are you able to close the door? 

Given what has been said, nonsystematic ambiguities are ambiguities that are not 
shared by other similar forms. The occurrence of nonsystematic ambiguity is more 
fortuitous Such an ambiguity typically arises from a special configuration of structural 
expectations, e.g.. for standard path successor steps. Either because of lexical ambiguity 
within the utterance or because the expectations are similar to each other, an utterance 
matches more than one expectation. 

In discussing aspects of the process model. I will be concerned only with systematic 
ISA ambiguity. For this type of ambiguity, the structural expectations that match the 
utterance are all of the same basic type; that is, the utterance has several possible 
interpretations as an initiator, or several as a standard path successor step, etc. Ambiguities 
in which possible structural expectations are drawn from different basic types are 
nonsystematic. 

Because systematic ISA ambiguities are predictable, it is possible to exploit special 
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knowledge about them and tailor special disambiguation mechanisms to them. For 
nonsystematic ambiguity, it appears that the best tools w* have available are general 
heuristics and a general disambiguation mechanism. 62 The mechanism is "general" in the 
sense that it is not keyed to particular ISA ambiguities. In this paper, I restrict myself to 
systematic ISA ambiguity; the next section discusses the application of special knowledge 
associated with ISA types. Before that, however, it is useful to take a general look at the 
dependencies that must be taken into account in the disambiguation process. 

10.4. Dependencies in ISA Disambiguation 

A computational approach to ISA disambiguation has several inherent problems. For 
those utterances with only an immediate speech act force, the recognition process will not 
necessarily be simpler than for utterances with more than one speech act force. For example 
"Are you able to open the door?" couM have the immediate force ASK only, with no 
secondary force For this type of utterance, it wiH often be necessary to consider the 
possibility of a secondary force in order to rule it out. 

Also, for utterances where an action could be implicit, it is often necessary to 
determine whether Pt intends an action (and what that action is) as part of the process of 
determining the immediate or secondary force. For example, 10.5 coeM have the immediate 
force STATE, or, say if PI were blackmailing P2, it cooW have an immediate REQUEST 
force. 

10.5 I want a million dollars. 



62. In [3] I discuss the use of ordering of match attempts as one part of a mechanism for 
handling nonsystematic ambiguity in task-oriented dialogue. 
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It is necessary to determine the likely implicit action (in this example, that P2 give PI the 
money) in order to decide whether this particular action can be REQUESTed in this 
environment. 

On the other hand, for most (and possibly all) ISAs that do have a secondary force. 
determining the immediate force can be done independently of the secondary force. This is 
because, as noted in Section 7.3. most ISA classes considered here are derived by only one 
level of indirection. 63 We thus do not get the case illustrated in Figure W.5 where a 
surface form leads to two different immediate forces with two different associated secondary 
forces, the latter of which must be considered in determining which of the former apply. 
Instead the case that obtains is Figure 10.6. The computational implication of 10.5 and 10.6 
is that immediate forces can be determined from the utterance and from context, 
independent of any consideration of the nature of the secondary forces that might apply. In 
the process model, we can exploit this independence of choice of immediate force where it 
exists, with the normal gains in simplicity and modularity that come from decoupling 
processes. 



63. Among the ISA forms that I am cc^sidcrmg, the drtly possible exception to this is the 
type 3 implicit-action form. Even in this case, however, we would have to have a very 
specialized type of form in order for determination of immediate force to be dependent on 
secondary force We would have to be dealing with what I am calling a frozen ISA form 
(see Section 7.4). The immediate force would not correspond directly to surface form, and 
there would only be a secondary, not a tertiary, speech act force. Although all of this is 
theoretically possible, I have not found any examples in the transcripts that I have 
examined. 
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< Immediate 
■ - - 
Immediate 



Immediate Force-I ♦ Secondary Force-I 
Force-2 ♦ Secondary Force-2 



Figure 10.5. A type of ambiguity in speech act force that probably 
does net occur. 



Immediate Force-I ♦ Secondary Force-I 
Utterance ^ — Immediate Force-I ♦ Secondary Force-2 



Immediate Force-I 



*-g. 



Utterance = "The garden is full of weeds." 

Immediate Force-I = a STATE that the garden is full of weeds 

Secondary Force-1 = a REQUEST that P2 weed the garden 

(e.g., in a context where this is P2's job) 
Secondary Force-2 = a SUGGEST that P2 weed the garden 

(eg., in a context where P2 is deciding what to do) 



Figure 10.6. The common type of ambiguity in speech act force. 
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H. Outline of a Mechanism for ISA Recognition 

Given the framework for ISA recognition discussed in Sections 10.1 and 10.2. along 
with the observations about ISA disambiguation from 10.3 and 10.4, we can now examine 
the mechanism for ISA recognition in more detail. This section starts with a discussion of 
the application of the rules from Section 6. Section 11.2 considers a mechanism for 
recognizing frozen ISA forms, and Section 11.3 looks at the recognition of implicit-action 
ISAs 

11.1. Applying the Rules for Explicit-Action Forms 

Recall that the ISA rules in Section 6 were based on properties of speech acts as a 
class, preconditions of individual speech acts, and (for some action-centered speech acts) the 
semantic input cases of the actions. In this subsection, I outline a mechanism For 
recognizing rule-based ISAs, splitting the discussion into the two directions identified in 
Section 10, SA -> ISA and ISA -> SA. 

SA -> ISA is the easier direction to describe for these ISA rules. We start with the set 
of structural expectations that are to be matched top-down (Section 9.2). (Whether the 
match process should proceed sequentially or in parallel is an open question that will not be 
considered here ) Focussing on one speech act from this set of expectations, we can obtain 
the preconditions of the individual speech act by following the links in the method structure 
described in Section 4. It is also straightforward to get from an action named in an 
utterance to an appropriate set of semantic input case specifications (or, given ambiguity, a 
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group of such sets) Given these semantic structure, we can apply ISA rules 6.3 to 6.10 to 
yield patterns particular to the speech act. (Alternatively, these ISA patterns can be pre- 
generated and associated with the speech act.) The ISA patterns associated with the speech 
act, along with patterns from Rules 6.1 and 6.2 which are applicable to speech acts in 
general, can then be evaluated with respect to the current dialogue environment. One of 
the major effects of the evaluation process is to bind representations for the appropriate 
dialogue participants to semantic case variables in the pattern. The surface semantic 
representation of the utterance can then be matched against each of these evaluated 
patterns. 

In the SA -> ISA direction we encounter predominantly non-systematic ambiguities 
among speech act forces. Therefore, the type of general disambiguation mechanism chosen 
determines how the successful matches are treated (or whether match attempts continue at 
all beyond an initial success). 

The ISA -> SA direction for the general rules is more complicated. When the 
recognition mechanism is working top-down, the method structure and associated 
relationships conveniently group the representations on which the ISA patterns are based. 
Working bottom-up. we must contend with the systematic ambiguity discussed in the last 
section This means that for almost every utterance we will have choices between speech 
acts conveyed, choices whether a given speech act force applies, or both. This introduces 
several problems. 

First, at least for a serial matching mechanism, in the worst case every different 
speech act might have to be checked to see if the utterance In hand matches any of the 
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possible indirect forms for that speech act. Also, independent of the matching 
implementation chosen, deciding which speech act forces were intended by PI can Involve 
very complex analysis, particularly if action-centered speech acts are among the possible 
forces. 

To cope with these difficulties, we can exploit two sources of information: knowledge 
about which speech act forms can be used to stand for others as JSAs and predictions of the 
type of knowledge that will be necessary to distinguish between competing speech act forces. 

To use this knowledge, I propose a two-pass mechanism consisting of initial screening 
followed by analysis Initial screening involves a match of an utterance against special entry 
patterns These patterns are abstracted, if necessary, from patterns produced by the 
application of ISA rules such as those in Section 6 to appropriate semantic structures (e.g., 
preconditions). The point of the abstracting process is to produce entry patterns for which 
the match will involve only context independent properties. This means that a match of an 
entry pattern involves only information explicit in the utterance or implicit information that 
can be derived independent of context. The authority relationship between PI and P2, for 
example, is a property that would not be appealed to as part of the initial matchr* 

The abstracted entry patterns have associated with them analysis procedures to 
discriminate between alternative interpretations. Once an entry pattern is matched (due to 
the abstraction process there will be only one success), the associated discrimination 
procedure is executed Discrimination procedures examine properties of the dialogue 



64. The abstraction process produces patterns that are as specific as possible while still 
permitting a context-free match. Abstraction is done on the variables of the ISA patterns 
only (see Appendix B.) 
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context to determine which speech act forces were intended by PI. These procedures are 
written to incorporate the specialized knowledge that we have about individual systematic 
ambiguities. 

For an example of the type of processing I have in mind, consider the following uses 
of a "Do you want..." form. (Other uses occur, but these four will do for an illustration.) 

ASK ♦OFFER 

one friend to another: 

Do you want to take my car? 

ASK ♦REQUEST 

one roommate to another: 

Do you want to answer the phone? 

ASK ♦SUGGEST 
parent to child: 
Do you want to try to build an arch? 

ASK 

one student to another: 

Do you want to go back to school? 

Any of these examples pass an initial match of a representation informally conveyed by: 

interrogative: whether P2 wants nonverbal action 
or verbal action whose destination is not PI 
except for: I. actions involving joint agency of PI and P2 

2. the action of remembering a previous REQUEST 



65. This pattern illustrates the abstraction process. Since abstraction only happens for 
variables, only the representation for the action desired is affected. For OFFER, this action 
is "some actton that complefwnti Pi's part of the acUoaMOFFER IV.). For REQUEST, the 
action is "a non-verbal action or a verbal action whose destination is not PI, except for the 
action of remembering a previous REQUEST." For SUGGEST, actions with joint agency 
of PI and P2 are excluded, while for ASK there is no restriction. To produce the entry 
pattern, the restriction on.the .action : jn- OFFER i& ignored, because context dependent 
information might be required Do.peuty it. This leaves SUGGEST and REQUEST with 
restricted actions. An entry pattern is then formed using both of these restrictions 
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Once any of the four types of utterance above has passed the initial screening, we need a 

procedure to determine the correct speech act forces. To discriminate between the various 

■■■■ = ■■■■ •*'- : ■■ *■ 66 

speech act possibilities, I suggest the tests listed below applied as shown in Figure 111. 

Tests for REQUEST: 

1. Does the action named fall within some authority. complementary role, or "be helpful" 
obligation of P2 to PI? (Recall the discussion of these obliptions in Jection 3.) 

2. Would one generally expect that P2 could successfully take responsibility for the action 
named? 

The ASK Block: 



1. Is there a reason why PI would want to know the literal interpretation of the utterance 
without necessarily wanting to set in motion the action named? 

2. Does the context preclude setting the action named in motion? This would be the case if 
the context is not a normal one for the action and there is no reason to assume that the 
standard conditions have been overridden. 



Tests for OFFER: 

I Would one generally expect the primary benefit of the action named to be to P2 in this 
context? 

2. Is PI to be an active participant (i.e. AGENT or CO-AGENT) in the action? 

3. Does it look like PI can successfully take responsibility for an action that complements the 
action named (eg,, "lend a car" for "take a car")? One issue involved here i% whether any 
objects of a transfer fall under Pi's ownership, control, etc. 



66. Recall from Section 103 that these tests are to be done with respect to the world model 
that P2 believes that PI is using. 
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/ 



Is PI the AGENT of the action named? 



NO 



YES 



Does the utterance past the tests for OFFER? 



YES 



NO 



Does the utterance pass the ASK block? 



YES 



NO 



X 
/ 



ASK*OFFER 



Does the utterance pass the tests for REQUEST? 



NO 



YES 



Does the utterance pass the ASK block? 



YES 



NO 



ASK ♦REQUEST 



\ 



Does the utterance pass the tests for SUGGEST? 



YES 



NO 



Does the utterance pass the ASK block? 



YES 



NO 



ASK ^SUGGEST 



J 



ASK 



Figure III. General organization for discriminating between four 
interpretations of the form "Dp you want <action>?" 
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Tests for SUGGEST: 



I For each equal authority relationship. between PI and P2, can the action named be 
coiisti ued as falling within the relationship? 

2. Is PI not to be an active participant in the action named? 

3. Would one generally expect the primary benefit of the action named to be to P2 in this 
context? 67 

4. Would one generally expect that P2 could successfully take responsibility for the action 
named? ■■'< 



The set of questions and Figure II I are intended for a limited set of interpretations of "Do 
you want?" forms, but they are representative of other discrimination procedures. The tests 
given include checks of the speech act preconditions (see Section 3), as well as checks of 
semantic case fillers for the speech act or action named. Some of these, eg., the third test for 
SUGGEST, may be more restricted than the comparable precondition or semantic case 
specification. The following criteria were used to decide which preconditions to exclude from 
the tests: 

1. Is this precondition shared by all of the speech act possibilities? 

Preconditions shared *y a»aheriwti¥es may be worth verifying, but not as pattof a 

discrimination procedure. 

2. Is this precondition being questioned or stated by the ISA form? 

This critej*** rules out RECAST (HI). SUGGEST (VI). awd OFFER (VI) for the 
"Do you want." example. 

3. Is the speech act usually the only evidence for this precondition? 



67 This condition seems to be limited to the case in which the utterance is acting as an 
initiator (see Section 9). Utterances -that tonvey a SUGGEST act but which occur in a 
context of stronger structural expectations seem to dispense with this condition and use the 
more 5 general SUGGEST'prtcondition (VII) instead, (F have suggested a basically ttjp-down 
(SA --> ISA) treatment of this latter type of utterance.) It is not surprising to see stronger 
restrictions on the range of initiators, since the context needed for disambiguation is weaker. 
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An example is the obligation in SUGGEST (IV) that the DESTINATION Of the 
SUGGEST consider the action suggested. Such obligations rest on very general 
properties of cooperative activity, for which there will not usually be good particular 
evident* Similarly, there is frequently no evio^e^heY Watt the speech act for 
intentional states of another individual. RfeeOtWttibnfiiPIHIs tyffc'maf be Worth 
checking as a tie-breaker, but the chances of getting useful information are small 
enough that these preconditions shoaM pnabtMf be excluded from the routine 
checking of a discrimination procedure. 

One general observation about the example procedure relates to the independence 
issues discussed in Section 10.4. Recall the point made there that immediate forces; can 
usually be determined independent of secondary ones. This independence is reflected in 
Figure 111, where 4he emphasis is on determining what, if any, secondary force applies. In 
that case (although not in all) the fact that the immediate force was ASK fell out from the 
first-pass match of the entry pattern for "Do you want..." forms. 

A discrimination procedure such as the one described will not always produce a single 
answer. Lexical ambiguities within the utterance might lead to several different 
alternatives. In addition, particular tests can lead to multiple alternatives, especially those 
that involve rich contextual structures such as role relationships. For cases such as these 
where a dtscrtraitwtion procedure fails to sufficiently narrow the possible interpretations, the 
recognition mechanism can appeal to other sources of information. One useful source is the 
implications of particular ISA choices (eg., with respect to pohteness, confidence of speaker, 
formality of the environment, urgency of the speech act, etc.). Another is the raw likelihood 
that a given form is used to convey an individual speech act or set of speech acts. The 
primary value of these sources would appear tabe in ruling out interpretations. 

The next issue that must be addressed is the generality of Hie discrimination 
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procedure associate with the four types of "Do you want.." utterances. Can the 
discrimination procedure be used for other types of ISA forms? In general. I think that the 
answer is no Although discrimination procedures for ISA forms wiH often appeal to similar 
sorts of information and use similar tests, they differ in the following ways: 

I. Deletions from and additions to the set of speech act possibilities can alter 
dependence relationships, which in turn affect the order in which tests are done. 

2 Some forms are more commonly used for a speech act than othees Even where the 
same speech act set is possible for two patterns, differences in raw likelihood might 
affect the order in which tests are performed. 

At this point it is not clear exactly how many discrimination procedures will have to be 
written The number of procedures is related to the number of systematic ambiguities, and 
this number depends on the number of speech acts and the number of ISA forms known to 
a system We are saved from the spector of infinitely proliferating discrimination 
procedures by the original intent of these procedures. Discrimination procedures are 
specially tailored to a specific systematic ambiguity. We need only write as many different 
discrimination procedures as there is specialized information. Where specialized 
information is lacking for an ambiguity, the entry pattern can have an associated list of 
possible speech act forces and a call to a general discrimination procedure. The core of this 
general procedure would tee a check of all the preconditions of the possible speech act forces. 

This has been a good deal of detail, and a summary may be helpful before going on. 
First, in the SA -> ISA direction the recognition mechanism starts with one or more speech 



act expectations The task is to hod or derive the appropriate ISA pattern*, evabte the^e 
With Vespeet to the current dialogue context, and taateh the utterance; against thern. The 
method and other semantic linkages discussed in Fart I provide a way to organize the ISA 
patterns and thus make the SA'-o ISA operation relatively straightforward- ,.,, 

In the ISA -> SA direction, the recognition mechanism starts with the ISA pattern, 
and its tasfc is to identify one or more appropriate undegtytng speech acts. For this. I have 
suggested special discrimination procedures assodatedtWith ISA patterns ,^or abstractions of 
them). These discrimination procedures eapitatiz? on the systematic ambiguities displayed 
by ISAs. 

11.2. Recognizing Frozen ISA Forms 

Recall that frozen forms were defined in Section 7.4 as follows. 

A frozen ISA form is an ISA form with an immediate force thatdott not correspond to 
the surf act form. For such an utterance, one or both of the following obtains; 

1. It can or does display syntactic behavior atypical of the rest of the form. 

2. It can or does display cooccurrence behavior atypical of the rest of the form. 

To record this special behavior, frozen forms require separate patterns* distinct from 
patterns derived from general properties of speech acts and actians. Abo* since we want to 
use different disambiguation schemes according to the direction of the match (a general 
disambiguation scheme for SA -> ISA and discrimination procedures for ISA -> SA). we 
need one set of frozen form patterns for each direction. While this may sound expensive, I 



m 

have found it to be useful, because other information related to the path-buiMing process 

can be associated with the different sets of patterns. (The "key" representations and "way" 

methods in ft] constitute such a split in patterns for frozen forms.) 

Besides having patterns distinct from the general ISA patterns, frozen forms will often 

have specially designed discrimination procedures. One reason for this is that the special 

surface behavior of some frozen forms will eliminate one of more possible interpretations 

altogether Consider, for example, the familiar pair 5.3 and 54: 

5.3 Can you please close the door? 

5 4 .Are you able to please close the door? 

Without please. 5.4 can be either an ASK only or secondarily a REQUEST; the please in 5:3 

eliminates the possibility that it was intended as an ASK 

Even forms that do not show special behavior themselves may be more prone to 

certain interpretations, justifying the use of separate discrimination procedures. For 

example. Ill is more idiomatic as a REQUEST than as an ASK. To convey an immediate 

ASK force, a form such as 11.2 is generally preferable. 

Ill Will you shut the door? 

11.2 Are you going to shut the door? 

(Example 112 could also convey a secondary REQUEST force, twit that is not my potttt 

here.) 

Frozen ISA forms, then, require separate patterns and. in at least some cases, separate 
discrimination procedures They also require special treatment iff the recognition matching 
algorithm. Because frozen forms are special cases, it is reasonable ttrtry to match them first 
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(in a serial algorithm) or tq assign a higher priority to a jucccjff^ frozen form match (in a 
parallel .algorithm)- This policy could : re$uk jn some iw-frqzeo utterances being treated as 
frozen ones, e.g., a "Can you <aaion>?" form. intended as immediate ASK, secondary 
REQUEST. This simplifiwtion does notseero to me to be serious. Often, information will 
be too sparse for P2 to determine how seriously PI intends .§ question such as the "Can you 
<attion>?" form. Sioce such a wide range,. of responses to ISAs is possible (as noted in 
Section 5.2) we are not restricting the ,flow of the, dialogue by Ueating all such forms as 
frozen. 

11.3. Recognizing Imphcit- Act ion ISAs i;v: 

In this subsection, we consider the problem of wtegtaUng precognition of implicit 
actions m with other aspects of the recognition of ISA forms. Discussion will concentrate on 
the "when" of implicit-action processing. Section 11.3.1 considers the treatment of ISAs 
without tertiary forces; the special problems associated with tertiary forces are discussed in 
Section 11.3.2. 

One important topic that is not considered here is the actual search process involved 
in identifying a particular implicit action. To a large extent, this process is dependent on 
general solutions to knowledge representation problems^ k is not a process unique to IS As- 
Nevertheless, issues of search strongly motivated the distinction among ISA types in Section 
7.5 Similarly, the tie-in between the rules in Sectnns 6 and 7 and the action representation 
in Section i was partly motivated by the. search problem. Aho relevant to this problem is 
the discussion of knowledge sources in Section 12^ these sources ; 4re expected to guide and 
constrain the search. 
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11.3.1. Integrating Implicit Actions into the ISA Mechanism 

As usual, the ISA recognition problem will be broken down according to the direction 
of match, and. as usual, the SA -> ISA direction is the easier one to describe. Recall that 
the implicit action rules given in Section 7.5 combined a pattern intended to match the 
surface semantic representation of the utterance with a pattern built from a general ISA rule 
(Section 6) That is. implicit-action rules can be thought of as beginning where explicit- 
action rules leave off. 

The straightforward approach to the SA --> ISA match for implicit-actions starts with 
the explicit action patterns evaluated to the current dialogue environment. Implicit-action 
rules are applied to these evaluated patterns In the course of this process, some implicit- 
action rules will be found to be inapplicable, and the others will be evaluated. The 
utterance can then be matched against these evaluated implicit-action patterns. Both 
explicit- and implicit-action patterns, evaluated with respect to the current dialogue context, 
could be matched in whatever order or combination desired, allowing, of course, for the 
dependence of implicit-action forms on explicit-action ones. 

There are several possible variants to this approach to matching implicit-action 
patterns; the order of evaluation and rule application depends to a bigh degree on the 
general representation scheme chosen Whatever the final choice, however, the important 
point about the SA --> ISA direction is that patterns evaluated with respect to the current 
dialogue environment can be quite specific. In many cases, a particular implicit action will 
already be represented in the evaluated pattern. For example, utterance 10.1 in the context 
discussed in Section 10 would match a SA --> ISA pattern that explicitly contained both a 
representation for "signature" and a representation for a signing action. 
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10.1 I need your signature. 

Thus, for such examples there is no need to search for an implicit action. For other SA --> 
ISA examples, the decision about whether a particular action is intended and the search for 
the action can be done as part of the pattern match. 

Turning now to the ISA -> SA direction we have the problem of integrating implicit 
actions into the discrimination procedure mechanism used for both general ISAs and frozen 
forms. The principal question is at what point in the process the choice of -- or search for 
- the implicit action should take place. This question requires careful analysis because the 
process of identifying an implicit action appears to be an inherently expensive one. For 
most of the implicit-action types, the process is heavily dependent on a wide range of 
knowledge sources, where the exact source is not predictable beforehand. 

Identification of implicit actions can either be done at a fixed place in the bottom-up 
recognition process or at different places in the process depending on the speech act 
possibilities involved. The obvious candidate for a fixed place is the first matching pass. 
That is. the implicit action would be determined on entry to the discrimination procedure as 
part of matching the entry pattern. The evidence, however, points against identifying 
implicit actions during the initial match pass. First, the implicit-action rules as written 
would not be easily applicable to entry patterns. This is because implicit-action rules are 
specific to individual speech acts, and entry patterns are not. Even if some implicit-action 
rules were generalized, the link between implicit actions and individual speech acts goes 
deep. For example. 113 can be either an OFFER by PI to give P2 a pen or a REQUEST 



163 



that P2 give PI a pen 68 



11.3 Do you have a pen? 

The choice between these two alternatives can only be resolved within the discrimination 
procedure There appears to be no advantage to generating both representations early and 
carrying them forward. 

A similar problem arises for forms where implicit action ambiguities occur for a single 
speech act force For example, consider the following form: 

II 4 Do you have <INSTRUMENT> for <action>? 

If the AGENT of the action is not specified, then there are two different REQUESTS 
among the possible interpretations The first possibility is a REQUEST that P2 give PI 
the INSTRUMENT, and the second possibility is that P2 take responsibility for carrying 
out the action using the INSTRUMENT. 

Given the close relationship between implicit action choice and speech act force, then, 
it appears that the choke of implicit action Should occur within the appropriate 
discrimination procedures, under explicit control of the system builder. 

11.3.2 Recognizing Utterances with Tertiary Forces 

As was the case for other ISA classes, the fact that utterances can convey a tertiary 



68. For an OFFER, example 113 is derived by an application of an implicit-action rule 
analogous to Rule 7.4 to the result of the general ISA rule 6.1 Clause iv. For REQUEST, 
example 11.3 comes from the application of the implicit-action rule 7.2 to the result of the 
application of the general ISA rule 6.4 to Precondition (II). 
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speech act force raises not only the problem of recognizing when such a force is intended, 

but also the problem of recognizing when such a force is not intended. Recall that among 

the ISA types considered here, type 3 implicit-action forms are the only ones that may 

systematically convey tertiary forces. 

5.18 Close the door. 

519 It's cokl in here. 

5.20 Must I tell you that it's cold in here? 

Using our standard examples, recall that in an appropriate context 5.20 has three forces: an 
immediate force of ASK that corresponds to the surface form, a secondary force of a 
representative that conveys information like that in 5.19, and a tertiary force of REQUEST 
that conveys information like that in 548. 

In the SA -> ISA direction, tertiary speech act forces present no special problems. In 
addition to the normal match of ISA patterns outlined in the first subsection, the Interpreter 
can apply the appropriate rules a second time to tjj»e J forms to find the possible double 
indirection patterns. A match of one of the pattern* so produced -with accompanying 
identification of an implicit action - indicates that a pmUn^l^^^'P ^^' After a 
successful match of a triple-force utterance, ihe Interpreter builds, three Event Tree paths, 
one for each force. 

Once again, the ISA -> SA direction is more difficult. The main problem presented 
by a double indirection in bottom-up processing is that a decision about whether the first 
possible indirection has been taken <e.g., ASK to the representative in 5.20 above) can be 
contingent on processing related to a second indirection (e^, a rejwesentative to 
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REQUEST). More specifically, the existence of an action appropriate to be an implicit 
action for a type 3 REQUEST form could be evidence not only for the second indirection, 
but also for the first This means that we definitely cannot think of the bottom-up 
recognition process for double indirections as two subprocesses ading independently of each 
other The decision about whether m utterance such as 5.20 is intended as a direct A&K, 
an ASK plus an indirect representative, or an ASK phi* a representative phis a REQUEST 
may be computationally messy because processing cannot be subdivided in ad cases. 

Although double indirection clearly poses SffecW problems. J do not want to give the 
impression that the situation is hopeless. Note first that the analysis phase described in the 
last subsection uses general procedures. The problem, then, is not finding a way to handle 
double indirection within this framework but, rather, finding a good way. It appears that 
the answer to most of our problems can be found in the data itself. In the transcripts that I 
have examined, the proportion of type 3 ISAs to other forms is small; moreover, of the type 
3 forms, only a subset are based on a double indirection. More important for bottom-up 
processing, only a handful of different forms need to be considered as possible double 
indirections (whether or not the utterances actually are). 

Beyond simple numbers, note that the literal interpretation of double indirections 
appears to require strong contextual support. Consider, for example, 11.5 and 11.6: 

11.5 Do you know that it's cold in here? 

11.6 I want you to know that it's cold in here. 

Without a specialized context (e.g., a psychological experiment) it is difficult to imagine 11.5 
or 11.6 uttered as an initiator with only the literal interpretation Intended. 
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The need for special contextual support is the most important property of the data for 
bottom-up processing, because it means that outside of such contexts the first indirection cart 
be assumed with relative safety. The task then becomes |o dejernwie whether or not a 
tertiary force applies, eg , whether 115 is an ASK and a STATE*r an ASK, a STATE. and 
a R EQU EST. Thi* task, is similar to the processing; of the possible single indirection "It's, 
cold in here." So, while, in theory double indirections are more, difficult computationally, it 
appears that in practice only rare ■-- and recognisable -- case require more processing 
powet than a comparable singJe indirection. 
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12. Knowledge Sources for ISA Recognition 

Throughout this paper I have emphasized the importance of context in the 
interpretation of IS As. While a complete account of the role of context in ISA recognition is 
not possible at this point, we can begin to identify the knowledge sources involved. This 
section examines the following six types of knowledge: 

1. domains of discourse 

2. complementary role relationships 

3. special properties of PI 

4. special properties of P2 

5. exceptional circumstances 

6. the course of the interaction to date 

These six knowledge sources play an important role in the interpretation of implicit-action 
ISA forms, e.g., in identifying the intent of "It's cold in here" uttered as a REQUEST for P2 
to shut the door. Although implicit-action forms will be used as examples in this section, 
this is not the only place that the six knowledge sources can be used. The sources also play 
a role in supplying semantic case information that is left implicit in explicit-action forms. 
For example, the INSTRUMENT is left unspecified in example 12.1, but, if PI uttered 12.1 
while handing P2 a bread knife, then the case assignment would be clear. 
12.1 Will you cut the bread? 

Finally, the six knowledge sources can supply iitfurraattoojneeded to disambiguate forces of 
an ISA. for example the ppsstf?^ rfa^i^Sjef *^ 
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Two additional observations must be made about the ; list -above. First, the six types 
of knowledge are not necessarily seen as independent. We can expect interaction between 
sources, especially between any of (2) through (6) and any of the other sources on the list. 
Second, recall that I am assuming for the sake of exposition that PI and P2 share an 
identical world model with respect to this knowledge. This assumption will not be correct in 
all cases. Sometimes ISAs uttered by PI will be interpreted by P2 in terms of P2*s view of 
Pi's view of the six knowledge sources. 

12.1. Domains of Discourse 

In this section, domain is meant to include 

1. the spatial environment: locations such as home, office 

2. the social environment: e.g.. public transportation, parties, school lectures 
3 the general task environment: ;e.g. ea«»ga^nelViWMiinf abartiieM 69 

In talking about the relation of domains to the interpretation of ISAs, I wish to make a case 
for the existence of routine actions associated with particular domains. By routine 1 wish to 
emphasize the concepts that appear in dictionary definitions, such as prescribed, regular, 
customary, habitual. That is, routine actions associated with particular states hi particular 
domains are not actions that are arrived at by intricate creative or planning processes. 
They are rather humdrum actions whose indications for use are common knowledge. 



69. 1 specify general in th« task environment because specific tasks will be discussed 
separately in Section 12.6. General task environments are activities that include a collection 
of in depend ent tasks, it rasks thart are MM subta&sof CTth Other. 
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To illustrate, I return to "It's coW in here" used as an indirect REQUEST for a 
particular action In an average home, the routine cold-weather responses to this utterance 
might include: 

1. check if windows or doors are open or are admitting drafts 

2. check to see if more sun can be let in 
3 turn up heat 

4. add clothes, blankets, etc to anyone who is cold 

The strict ordering of the alternative actions is not obligatory in all cases, but it is 
appropriate in some. Note that such routine actions can be associated with very specific 
domains. Another household might use the same responses but order them differently: A 
third household might include throwing a log on the fire as one of the actions. For this 
example, the domain may be not only a partittttet heme but even a particular room. To 
represent the routine actions of a domain; we can group them by the goaKs) that they 
achieve and write choice methods for each cluster. These choice methods will contain the 
known tradeoffs between the actions, as well as any ordering that fe appropriate. 

Returning now to speech acts, one way to communicate an implicit action (or other 
implicit information) in an ISA is to draw on the routine actions associated with the 
particular domain. If a particular action is intended by PI, it mast be distinguishable from 
among this set of possibilities. Otherwise, the ISA is not successful Where the alternative 
actions have a strong a priori ordering, we can f iter the alternatives according to what is 
possible in the domain and add criteria provided *y the other knowledge sources discussed 
in this section. Where there is no particular order of choice, either other knowledge sources 
should be applicable or other routine actions should be inapplicable in the environment. 
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Assuming that an action was intended, if the implicit action is Hill ambiguous, then either 
P2 has too rich a set of possibilities or PI has been vague. 

12.2. Complementary Rates-of PI and P2 

Another source of information for clarifying IS As is the role* i*., a set of actions 
which may be associated with a dialogue participant. In this subsection, I am considering 
only complementary role relationships, eg., doctor/patient wconductor/pass^er. Roles of 
dialogue participants that are not complementary can also affect the interoretatipn of ISA* 
they are included under properties of PI and P2,j^ections 124 and (2.4). -Like domains, 
rotes exhibit different levels of aggregation. On, an assembly lint, a role might be a 
relatively small set of actions or even a single one. such as tightening a particular boh. The 
rote for, say, a homemaker could contain manyinwitactioBS, possibly grouped into related 
sets such as cooking, house cleaning, home repair, and so forth. Here, hmtmaker is what I 
will call a rote name Abe word rck will be reserved«for theactwnset 

Rotes are particularly important sources of toforrnatjon. about J§As »n task-oriented 
environments, where formal job descriptions often exist and where labor is often explicitly 
divided. They also come into play 4n more informal environments, where tasks and 
responsibilities are divided by custom, by social rote, or by agreement. 

An example of an implicit-action ISA based on a complementary rote relationship is 
12.2 when addressed to P2, an auw mechanic (a rote name} at P2*» place of business (a 
domain). 

12.2 The clutch on my van is gone. 
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Under the conditions specified, 122 can count as a REQUEST that the mechanic fix or 
replace the auto part, in this case the clutch. This is due of the actions In an auto 
mechanic's role Utterance 12.2 can also have the force of a simple STATE, for example if 
it is directed to a friend encountered while waiting in line. For this example in this 
environment, it takes the mechanic's role to make an appropriate action, and hence a 
REQUEST sense, clear. To ask the friend met in line for a ride, one would generally need 
a more specific REQUEST form in addition to 12.2. 

One question that must be asked about roles is what effect multiple roles have on the 
interpretation of ISAs. Participants in a dialogue often share more than one complementary 
relationship. While domain introduces some possible roles (e.g., host or hostess at a party), 
other roles arise strictly from the relationship of PI and P2. These multiple relationships do 
not seem to confuse the interpretation of ISAs as much as one might suspect. Consider an 
example in which P2 is both Pi's doctor and Pi's friend. If PI describes a pain to P2 at a 
party, it is probable that PI is REQUESTing more than sympathy; very likely the appeal is 
to P'2's special expertise and the implicit action is a diagnosis. Where more than one 
complementary role relationship exists, the default seems to be to appeal to the role with the 
most powerful action in the area of the REQUEST. 70 This is not to say, however, that 
confusion cannot occur, and in this case, a clarifying question can be asked. 



70. This is subject, of course, to constraints. Example 12.2 uttesed to a mechanic at a party 
would seldom be seen as a REQUEST to replace a clutch right then and there. This 
reading could be ruled out by the absence of the neeemry toots and parts, by the party 
domain (because it conflicts with standard party activities), or by some other condition. 
Note that the first two conditions named correspond to the second blocking question that 
leads to an ASK interpretation in the example worked out in Figure 11.1. 
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12.3 Properties of PI ; r 

While complementary rofes rebteto the mutual relationship of PI and P2,,properties 
of Pi independent of trie relationship can' also affect the interpretation of an ISA. One 
important case mentioned above is the non mutual role, such asian appeal to *he medical 
expertise of P2 where PI is not a patient Another important class of properties includes the 
likes and disltkes of PL Consider, for example, l2Jutter«d < a* a REQUEST: 
12.3 I have to be in Cleveland by £00 tomonww ^miming. 

Let us assume that 123 is uttered in an of fice (a domain) to a secretary (a role name). Let us 
assume further that the routine response to such a REQUEST in this office is to make 
travel arrangements, in particular to book a flight If the distance is appropriate. Consider 
the case, however, in which PI is afraid to fly and has made P2 aware of this fact. In this 
case, some other routine action may be understood, such as making train reservations. This 
default would be associated with PI. 

In general, properties of PI may block alternative actions suggested by other 
knowledge sources (and hence perform a disambiguation function), they may suggest totally 
new alternatives, or they may cause a reordering of existing alternatives. 

12.4. Properties of P2 

Properties of P2 that are common knowledge to PI and P2 can also have an effect on 
the interpretation of an ISA 

12.4 Tabby needs a new home. 
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If P2 is aWergic to cats, PI knows it, and P2 knows that PI knows, then 12.4 will not 
(assuming the good will and good memory of PI) be a REQUEST that P2asdopt Tabby. 

In general, properties of P2 are of a type and behavior similar to properties of PI. 
There is. however, an added dimension, In deciding whether an interpretation is affected 
by properties of P2, P2 must determine both whether PI knows about the property and 
whether PI seems to be aware of it at the time of the REQUEST. To make these decisions, 
P2 will usually be operating with .highly incomplete knowledge. If there is any question of 
the right choice and if anything significant rides on the choice. Pi will often state the 
property (eg, 12.5 in response to 12.4) as a verification of the choice. 

125 If I weren't allergic to cats, I'd be happy to take Tabby, 

12.5. Exception Conditions 

The last four subsections have enumerated sources of knowledge, with the stated 
assumption that standard conditions prevailed. Interpretations of ISAs can be affected 
dramatically, however, by the existence of exception conditions. Exception conditions are 
not a totally separate source of knowledge but are integrated in|o all four categories 
discussed so far domain, rote, and the special properties of PI and P2. 

An example of an appeal to a domain-related- exception condition is 12.6 uttered at 
home in a power failure. 

12.6 It's dark in here. 

If PI is aware of the power failure, then it is very unlikely that 12.6 is a REQUEST that P2 
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fttp the light switch. Due to the exception condition, in this particular domain getting a 
flashlight mighf »e f Hfhe ©referred action^ wen" » the one intended by Plfc 

ExceptfoH condHtow ate not limited to domains. An exception condition in a role 
might arise when a piece of machinery is broken and a different manufacturing procedure 
must be used. Finally.' for PI and P2,an exception condition might be a broken leg. 

Exception conditions not only blot* or reorder standard alternative* they may also 
bring totally new sets of alternatives of their own. f* example, the use of a flashlight, 
while a sototion to darkness in a standard situation, is not a rtuttm solution In most 
domains. Note that an implicit action, or other aspect of an ISA, can only be "clear from 
context" in an exception situation when the exception condition hat its «am routine actions 
associated with it We are still talking, then, about noncrtativt, routine actions. The 
difference is that these actions are associated with non-routine condnians. 

12.6. The Course of the Interaction to Date 

This brings us to the hist category, ihe effect «f the mwaction - botl* linguistic and 
non-hnguisttc ■— between PI and P2. Several ^different things may occur here. First. 
information may be gained in the course of the dialogue that directly pertains to the 
categories already discussed. PI may explicitly specify the domain, eg. let's view this 
problem geometrically." One of Pi's roles may also be specified explicitly, e^ "Speaking as 
the owner of this property. I want you to know that you're trespassing." Similarly, PI- or 
P2-based properties may be discussed explicitly in the dialogue, or an exception condition 
may be noted. 
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Interpretations of ISAs may also be affected by specific tasks that are now or were 
previously underway in the course of the current interaction, as well as the goal structures of 
PI and P2 revealed in the dialogue ^ An example of the effect of specific task is the 
interpretation of example 127. 

12.7 We have twenty cartons of paper clips. 

If the domain is an office and the specific task is taking inventory, then 12.7 can be 
interpreted as a simple representative. If the task is rearranging the supplies, then 12.7 
lends itself naturally to interpretation as a REQUEST foradvice on where to put things or 
for help in moving them An example of the effect of discussion about goak Is 12.8, which 
can be seen as part of an extended exchange related to planning the day. 

12.8 

12.8.1 PI: I think I'll go to the Afr P today. 

12.8.2 P2: We're out of flour. 

It is reasonable to construe P2's utterance in 12.8 as a REQUEST that PI buy flour on the 
planned shopping trip. To see that this interpretation of 12.8.2 is dependent on 12.8.1, 
compare the following: 

12.9 

12.9.1 PI: Look at the snow. Let's bake a cake. 

12.9.2 P2. We're out of flour. 

The most obvious interpretation of 12.9.2 is as a representative acting as a rejection of the 
suggestion in 12.9.1. 

71 1 am assuming that goals and tasks from previous dialogues and other shared 
experiences would already be assimilated and would be accessible as properties of the 
participants. 
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Another way that the interaction to date may affect the interpretation of IS As is in 
discussion of the preconditions of actions. Consider, for example, 12.10. 

12.10 ; 

12.10.1 PI: Do you play dominoes? 

12.10.2 PI: I need a partner for the mixed doubles tournament on Friday. 

The presence of 12.10 1 clarifies the interpretation of the particular action in 12.10.2: PI wants 
P2 to agree tone Pi's partner in the tournament., Without thjs. preliminary remark, the 
particular action would be considerably less clean Utterance 12.JQ.2 might just as well be a 
RECtUEST for advice or for sympathy. 

Although 1 am not attempting to describe how the dialogue affects the interpretation 
of ISAs, a case could be made for viewing the examples in this subsection as derivable from 
methods (Section 4) and the Event Tree and recognition framework (Section 9). This is a 
natural approach, because the Event Tree is one record L$f'$* linguistic and non-linguistic 
interaction between PI and P2. Moreover, methods can provide strong connections between 
utterances and strong expectations about conditions in the environment. We can therefore 
expect methods and the Event Tree to be useful in modelling many (although not aW) of tne 
interactions between PI and P2 that can have an effect on ISA interpretation. 



For PI to construct a well-formed indirect speech act, there must be a reasonable 
chance that P2 will interpret the utterance as intended. In general, it Is only by an appeal 
to shared knowledge that a correct interpretation becomes possible. The six knowledge 
sources discussed in this section give us a start toward a catalogue of the information 
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to dialogue participants (or, more precisely, the information that one reasonably 
xpects will be common). The knowledge can come out of shared experience, out of 
mimication between participants, or from the standards individuals and institutions 
impose to make certain actions and knowledge "routine". 



common 
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13. In Conclusion 

In this paper I have presented an account of a large number of ISA forms within 
task-oriented dialogue While the account was strongly motivated by computational 
considerations, non-computational arguments were used to justify many of the distinctions 
made The major characteristics of the approach that I have taken are as follows: 

- Starting from the observation of traditional speech act theory that language is 
action, I have advocated a uniform representational scheme for speech acts and non- 
linguistic actions. The foundation of the representation is the OWL-1 method. 

- Speech acts and other actions were combined into larger patterns of action, also 
represented by OWL-I methods. These larger patterns were used as a partial model of 
the structure of dialogue. This gave a framework in which to discuss issues of ISA 
recognition. 

- The central processing assumption has been that the phenomenon of indirect 
speech acts is too complex to admit to a single, uniform computational treatment. It is 
necessary, then, to identify classes of indirect speech acts that share similar 
computational properties and use different representations and processing strategies 
for the different classes 

- I have claimed that a process model of ISA usage needs at least the following: 

1. rules based on general properties of speech acts and other actions 

2. patterns that represent special syntactic and cooccurrence behavior for 
frozen ISA forms 

3. rules that embody relationships that can be used in referring to implicit 
actions 

There is much more that needs to be done before a full process model of ISAs is 
achieved. First, there are the areas that were explicitly excluded, among them, hedged 
performatives and "second order" ISAs. implications of ISA choices, and the generation of 
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ISA forms. Second, more in-depth analysis is needed for individual speech acts. Some of 
these that are particularly interesting are the speech acts for stating facts, beliefs, and 
opinions. Despite a great deal of attention from philosophers and some linguists, there is 
still much that can be done in this area. Third, notions of context need to be more fully 
worked out A fourth major area is reasoning for disambiguation; we are far from a 
complete understanding of how to block competing, but incorrect, interpretations of ISAs. 

Finally. I would like to comment on the implications of all of this for human-machine 
dialogue Judging from the open problems I have listed, one might expect that computer 
systems that reliably recognize ISAs would be many decades away. 1 think, however, that a 
partial solution to the problem is much more accessible to us. Using the speech act 
categorization developed here, it is possible to identify useful subsets of ISAs. This already 
is an advantage, since it is often very difficult to identify useful subsets of English; 
frequently implementers find themselves in what amounts to an "all or nothing" situation. 
Moreover, given the categorization, it will be easy to describe what a system actually does 
for ISAs. Since the categories were justified to a large extent by surface English behavior, 
they should appear natural to speakers of English. Again, this is an area in which 
implementers have had only mixed success. 

Another way in which the implementer's job can be greatly simplified is by carefully 
restricting the speech acts accepted and the tasks available for discussion. (Both restrictions, 
of course, must be made clear to the user.) Limiting the types of speech act accepted means 
that some secondary forces are simply not possible in the environment. With only a few 
tasks handled, it is not as difficult to identify an implicit action; fewer tasks mean that the 
task intended will usually be "clear from context." 
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Thus, while many problems remain to be solved, I feel that not only is a 
computational theory of ISAs well on its way, but we are at the point where ISA processing 
can be done cleanly, if not yet completely, in computer dialogue systems. 



This empty page was substituted for a 
blank page in the original document. 



183 



References 



[1] Allen, J A plan based approach to speech act recognition, Technical Report No. 131/79 
(February 1979), Department of Computer Science, University of Toronto. Toronto, 

Canada. 

12] Austin, J.L How to Do Things with Words, Oxford University Press. New York, 1962. 

[3] Brown, C.P. A framework for processing dialogue, LCS TR-182 (J une ,977 ^ 
Laboratory for Computer Science, M.I.T., Cambridge, MA, 

[4] Brown, G P Linguistic and situational context m a model of task-oriented dialogue, to 
appear in L Vanya and KJ.J. Hintikka (eds): Models of Dialogue, D. Reidel, 
Dortrecht, Holland 

[5] Bruce. B.C Belief systems and language understanding, BBN Report No. 2973 
(January 1975), Bolt, Beranek and Newman, Inc., Cambridge, MA. 

[6] Cohen, PR On knowing what to say: planning speech acts, Technical Report No. 118 
(January 1978). Department of Computer Science. University of Toronto. Toronto. 
Canada. 

[7] Davison, A Indirect speech acts and what to do with them, in: Cole and Morgan 
(Eds.) Syntax and Semantics, vol 3, Academic Press. New York, 1975. 

[8] Deutsch. B.C. The structure of task oriented dialogs, Proceedings IEEE Speech 
Symposium, Carnegie- Mellon University, Pittsburgh, PA (April 1974). 

[9] Fraser, B. Hedged performatives, in: Cole and Morgan (£d$) Syntax and Semantics, 
vol 3, Academic Press, New York, 1975. 

[10] Gordon, D. and Lakoff, G. Conversational postulates, ift: Cole and Morgan (Eds.) 
Syntax and Semantics, vol 3.. Academic Press, New York, 1975. 

[II] Grice, HP. Logic and conversation, in: Cole and Morgan (Eds.) Syntax and 
Semantic s. vol 3.. Academic Press, New York, 1975. 

[12] Grosz, B. The Representation and use of focus m a system for understanding dialogs, 
Proceedings of the Fifth International Joint Cmferente on Artificial Intelligence, 
Cambridge, MA (August 1977): ; » :• ' 

[13] Hawkinson, L. The representation of concepts in OWL* Atearm Papers ef the Fmtth 
International Joint Conference on >4rft?lcl# tnteMginc*, Wbtiist, Georgia, USSR 
(September 1975). 



184 



[14] Labov. W. and Fanshel, D. Therapeutic Discount, Academic Press. New York, 1977. 

[!5] Lakoff, R. The logic of politeness, or. Minding your p's and q's, Papers from the ninth 
regional meting of the Chicago Linguistic Society. University of Chicago Department 
of Linguistics, Chicago (1973). 

[16] Levin. J. and Moore, J. Dialogue games. Meta-communication structures for natural 
language interaction. Cognitive Science Vol! No. 4, October -.1977. 

[17] Long, W.J A program writer, LCS TR-187 {November WIT), Laboratory for Computer 
Science. MIT, Cambridge, MA. 

[18] Martin. W.A. 0*1,: A system for building expert problem solving systems involving 
verbal reasoning, 6.871 Course Note* unpublished, MIT., Cambridge, MA (1974). 

[19] Minsky, M A framework for representing knowledge, in:- Winston (Ed.), Visual 
Information Processing. MIT Press, Cambridge, MA 1975. 

[20] Moore, J. A.. Levin, J. A., and Mann. W.C. A goal-oriented model of human dialogue, 
American foutna! of Computational Linguistics, Mteofkhe 67 (|977). 

[21] Morgan, J.L. Two types of convention in indirect speech acts, in: Cole (Ed.) Syntax 
and Semantics, vol. 9, Academic Press. New York. 1978. 

[22] Perrault. C.R.. Allen, }. and Cohen, P, Speech acts a* a basis tar understanding 
dialogue coherence. Theoretical Issues in Natural Language Processing-!, University 
of Illinois at Urbana Champaign, Urbana. IL (July 1978), 

[23] Rubin, A. A framework for comparing language experiences, Theoretical Issues in 
Natural Lanptage Processing-!. University of Illinois at Urbana-Champaign, 
Urbana. IL (July 1978). 

(24] Sadock. J.M. Toward a Linguistic Theory of Speech Acts, Academic Press, New York, 
1974. 

[25] Sadock. J.M. On testing for conversational impUcattU-e, in: Cole (Ed.) Syntax and 
Semantics, vol. 9, Academic Press. New Yoek. 1978. 

126} Sag. I. and Liberman. M The mtonaiional disambiguation of indirect speech acts, 
Papers from the ninth regional meeting of Ike Chicago linguistic Society. University of 
Chicago Department of Linguistics, Chicago (197&» 

[27] Schank. R. and Abeteon, R Scripts, Pians, Goals, and Understanding, Lawrence 
Ertbaum Associates, Hillsdale, NJ; 1977 



185 



128] Schegloff, E and Sacks, H. Opening up closings, in: Turner (Ed.) Ethnomethodology, 
Penguin Books, Middlesex. England, 1974. 

[29] Searle, J.R Speech Acts, University Press, Cambridge, 1969. 

[30] Searle, JR. Indirect speech acts, in: Cole and Morgan (Eds.) Syntax and Semantics, vol 
3., Academic Press. New York. 1975. 

[31] Searle. J R A taxonomy of iltocutionary acts, in: Gunderson (Ed), Language, Mind, 
and Knowledge. University of Minnesota Press, 1976. 

[32] Swat tout. W.R A digitalis therapy advisor with explanations, Proceedings of the Fifth 
Internationa! Joint Conference on Artificial Intelligence, Cambridge, MA (August 

1977). 

[33] Szolovits, P., Hawkinson, L , and Martin, W.A. An overview of OWI„ a language for 
knowledge representation, Proceedings of the Workshop on Natural Language for 
Interaction with Data Bases. International Institute for Applied Systems Analysis, 
Laxenburg, Austria. 

[34] Vetschueren. J. Speech act theory: A provisional bibliography with a terminological 
guide. (May 1976) Indiana University Linguistics Club. Bloomington, IN. 

[35] Verschueren, J. The analysis of speech act verbs: Theoretical preliminaries, (August 
1977) Indiana University Linguistics Club, Bloomington, IN. 

[36] Zwicky, A.M. and Sadock, J M. Ambiguity tests and how to fail them, in: Kimball 
(Ed ) Syntax and Semantics, vol. 4, Academic Press, New York, 1975. 



This empty page was substituted for a 
blank page in the original document. 



187 



A ppendix A. Core Dialogue Methods 

This appendix contains a list of some of the core dialogue methods associated with 
the speech acts from Section 3. Along with the list of core methods is a description of the 
OBJECT case specification as well as a description of the major standard path steps. The 
actual representation of these methods is done in 0WI.-1. 



ASK-AND-ANSWER 
OBJECT: a what-, where-, who-, whether-, or when-question to be answered 
MAJOR STANDARD PATH STEPS: 

PI ASKs the question 

P2 evaluates the query to decide whether to comply. 

P2 finds the answer. 

P2 gives the answer. 

ASK-AND-DESCRIBE 
OBJECT: a thing or how-question 72 
MAJOR STANDARD PATH STEPS: 

PI ASKs what the description is. 

P2 decides whether to comply. 

P2 gives the description. 

ASK-AND-EXPLA1N 
OBJECT: an action, a how question, or a why-question 
MAJOR STANDARD PATH STEPS: 
PI ASKs what the explanation is. 
P2 decides whether to comply. 
P2 gives the explanation. 

72 How-questions are split between ask-and-describe and ask-and-exp»ain depending on the 
type of information that seems appropriate. Of cows* ask-and-describe and ask-and- 
explain can also be triggered by a direct fewest for a descfiption or explanation, 
respectively The motivation for distinguishing = asVa«d^de$crtb*,aijd askand-explam from 
ask-and-answer is that the first two will tend to be ; involved wttb longer answers that 
require more selection and organization of the information. Ajk*BOVdescribe and ask-and; 
explain are distinguished from each other by the aspect* of the topic that are considered 
relevant in answering a how^question; for ask-andTexplain, the ^emphasis ii on causal 
relationships. ^ .t -n 
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REQUEST- AND-HELP 
OBJECT: an action that can be the object of REQUEST 

exception: a repetitive action 
M AJOR St AND AR D PATH STEPS- 
PI REQUESTS P2 to take responsibility for helping 

PI wrtb an action. 
P2 decides whether to comply, 
in either order: 

P2 ACKNOWLEDGES the REQUEST. 
If not already understood, roles in the action are assigned 

to the participants, 
then: 
The action is carried out. 



SUGGEST-AND-ACCEPT 73 
OBJECT: an action 
MAJOR STANDARD PATH STEPS: 
PI SUGGESTS the action. 
P2 decides whether to comply. 
P2 ACCEPTS the suggestion. 



73. Note ttiat the action suggested fs not represented as a step in SUGGEST-AND- 
ACCEPT. This i* btscattse I see suggestions as directed towird acceptance of an idea, not 
directly toward' ¥h* axcomptfsfcroent of actions (hence the poJsiMe response. "That's a good 
idea''), There % however, a ctese relation between a 9UggesttoRfto;<do an action wad *fee 
attion sctuaWy being cSrrfed out. Wrepre^m-ffcis, *» cainsaf that there tea general social 
directive that peopVsttotfM' behave rea$oha«fr1f there is good Reason to caw* out an 
action (and no overriding objection to it) the* tbe action should be carried out Thi$ 
information would be incorporated into processes for selecting goals. 
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Afl|)cndix B. Matchi ng Semantic Representations against Rules 

Although the ISA rules in this paper have been presented informally, I am assuming 
a highly restricted relationship between the patterns specified by the JS A rules and the 
semantic representations of particular English utterances. This appendix summarizes the 

relationship. 

First, we must consider the level of representation. The dialogue model \ am 
assuming assigns more than one semantic representation to an utterance, (These 
representations are summarized in Appendix k) I, wish, however, to make my remarks on 
matching as independent as possible of the particular choi^of representation level for rules 
and utterances. For the purpose of explanation, it- is assumed tkat the two representations 
are at the same level or close enough so that the correspondences between elements can be 
easily identified. 

This brings us to the constraints on matching. In; saying that a c semantic 
representation S matches a pattern in an ISA rule, I mean that the elements of S correspond 
to the elements of the pattern by any one of the following relationships: 

. 1. An element of S matches an element of the pattern exactly. 

2. An element of S has a subclass relationship to the element of the pattern. (I am 
assuming that the "class" relationship in "subclass" and "superclass" below is OWI.-l 
specialization. See Hawkmson U3J. The important point hereJs that the two elements 
are related to each other by an explicit classification link in the knowledge base.) 

3. An element of S has a superclass relationship to the element of the pattern. 

4 An element of S may match a variable in the ISA pattern either by relationships 
(IH3) or by being an instance of the variable type In this pager, variables are 
signalled by the use of some, a, or the in the statement of rules and preconditions. The 
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indicates the case where a variable must be matched by an instance. >f andwm* ar* 
used where either an instance match may occur or relationships 0M3) may be 
appealed to. 

5. Phrases in rules and speech act preconditions appearing in this paper in 
parentheses are not expected TO hive c6#fspdwtthg eleinenis in utterance 
representations. 

6 One of the relationships (2), (3), or (4) apply, and added constraints on what may 
match the pattern are either given explicitly in the pattern or supplied by the«hef 
parts of the match that have been completed. 



The third relationship above is necessary, but it fr%nificantff less ttfcnmon than the first 

two Since parts of the uttefanceare more geherart tWrtr tWe pattern, in these cases context 

plays a role in determining tharWmbre^iffceltfneir^ to 

To ilhistrare the fburtn m*n^ 

B.I Something good is bound ft (bike of it it yott buy a doRtr ticket. 
B.2 You'll havt a good chantt of winning if you buy a doflar ticket. 

Examples B.I and B.2 can be interpreted as HE^UISTj demed from Rule 62, repeated 



PI can convey a speech act indirectly by -- 

- a representative with the proposittonal content that some desirable result or 
results can be expected for some intended effect of the speech act. 



Under this interpretation, the parts of the example in Italics would correspond to the phrase 
"some desirable result or results" in the rule. The italicized clause in BJ would be related to 
the rule by one of the relationships (IM3) above, depending on what stand one took on the 
relationship between the good and the desirable. The italicized ctuwe in B.2 would be 
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considered as conveying an instance of the phrase in the rule, since winning is one possible 

desirable result. 

In this account of matching, there is still a missing piece: the actual relationships 
between particular representations. In keeping with the informality of the rules that I am 
presenting. I do not attempt a taxonomy of representations (i.e. hierarchic structures from a 
world model). 74 In addition. I make no attempt to characterize the features of English that 
make some realisations of a rule more idiomatic than others. My purpose in discussing 
matching is to show that, despite the informality of the presentation. I am expecting the 
matching process to be a relatively straightforward, highly constrained operation. 



74 Two expressions that come up repeatedly in the statement of the rules, however are 
worth noting I assume that believe has a superclass relationship to all idea-holding 
concepts. e.g"!. thinking, knowing, assuming, hypothesiiing. etc. Want has a superclass 
relationship to all goal-holding concepts, eg., desiring, needing, etc. 
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Ap pendix C ISA Examples for Rules 61 to 6.9 

The rules restated: 

PI can convey a speech act indirectly by -• 

Rule 6.1 

(i) ASKing whether the intended speech act is necessary 

(ii) ASKing whether an equivalent speech act <ie.. one with the same principal 
intended effect) has already been performed 

<iii) ASKing whether the principal intended effect can be expected to occur without the 
speech act 

(iv) ASKing whether the principal intended effect of the speech act has already 
occurred 

Rule 62 

- a representative with the propositional content that some desirable result or results 
can be expected for some intended effect of the speech act 

Rule 6.3 (for Pl-based preconditions): 

- a representative of the topic of a Pl-based precondition of the speech act. 
Rule 6.4 <for P2-based preconditions): 

- an ASK of the topic of a P2-based precondition of the speech act. 
Rule 6.5 (for unmarked preconditions): 

-- an ASK of the topic of an unmarked precondition of the speech act. 

This rule applies in a context where PI believes P2 has better knowledge of the 

condition in the precondition topic. 

Rule 6.6 (for unmarked preconditions): 

- a representative of the topic of an unmarked precondition of the speech act. 

This rule applies in a context where PI believes PI has better knowledge of the 
condition in the precondition topic. 
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Rule 6 7 (for unmarked preconditions): . ;i 

- a representative of an unmarked precondition of the speech act where PI believes he 
or she has better knowledge of the condition. 

Rule 6.8 (for groups of preconditions): h ■»,-;• 

a REQUEST form of an action that is a goal of PI (j,e. A for "PI wants A"). This,; 
rule is applicable only when the speech act has preconditions that are exact matches or 
specializations of the four preconditions of REQUEST. 

Rule 69 

- an ASK about whether P2 will take respomrtnlity for carrying out an "active" action 
that is a goal of PI (i.e.. A for "PI wants A"fe - ; .i-. r -.! :.„•:■• 

This rule is applicable only when the speech act has preconditions that are exact 
mMne* or sp*ci«H*ifl©ro of the km&emdM*mrtte£$M$Ti 
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ASK 

propositional content: some question 

simple form. <interrogative> e.g.. Where's the mustard? 

principal intended effect, that P2 tell PI the answer to the question in the propositional 
content 

Rule 61 

-i- Do I need to ask you where the mustard is? 
-ii- Did I ask you where the mustard is? 
-iii- Do you intend to tell me where the mustard is? 
-iv- Did you tell me where the mustard is? 

Rule 6.2 

- I'll be able to finish these sandwiches if you tell me where the mustard is. 

Precondition-based Examples 

I. PI wants to know the answer to the question. 
-R6> I want to know where the mustard is. 

II PI believes that P2 can tell the answer to the question. 

-R6.5- Can you tell me where the mustard is? 
-R6 6- You can tell me where the mustard is. 
-R6 7- I believe you can tell me where the mustard is. 

III PI believes that P2 is willing to tell the answer to the question. 

-R6.4- Would you be willing to tell me where the mustard is? 
-R6 4- Do you want to tell me where the mustard is? 

IV. PI wants P2 to tell PI the answer to the question. 
-R6.3- I'd like you to tell me where the mustard is. 

V. PI believes that P2 has some obligation (a role obligation, authority obligation, or 
general obligation to be helpful) to P» to teH PI the answer to the question. 
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-R6.5- GAP 75 

-R6.6- You ought to tell me where the mustard is. 

-R66- It's your obligation as a member of this household to tell rne where the 
mustard is 
-R6.7- 1 think you should tell me wsherelhf imisl»rd *$. 

It/V, together: 

-R6 8- Tell me where the mustard is. 

-R6.9- Will you tell me where the mustard is? 



75. Although it is possible to construct contexts in which this form can be used, it seems to 
be only marginal. Forms such a "Shouldn't you tell am „?" and "Bont you think you 
should tell me?" are far more common (See Section 6.3.1 for discussion of these latter 
forms.) Other forms derived from application of Rule 63 to A$fc V. do not convey an 
ASK of the proposition** content at ads lor ma**fto.?Mmm idM* ..£. is a REQUEST 
that P2 refrain from telling. Perhaps Rule 65 does not apply to ASft V. because PI is 
assumed to know about any obligations that P2 has to PI. so that there is no reason to 
question P2. 
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OFFER 

propositions! content: some action that PI believes w*U be of benefit to P2 

simple form: none 

principal intended effect: that P2 accept Pi's commitment to talce responsibility for the action 
in the piopositional content 

Rule 61 

-i- ?Do I need to offer you a ride to the airport?' 6 
-ii- Has anybody offered you a ride to the airport? 

-iii-GAP 77 

-iv- Have you accepted a ride to the airport? 

Rule 62 

- I'd feel a lot better if you'd accept a ride to the airport. 

Precondition-based Examples 

I PI wants to take responsibility for the action. 

-R6.3- I want to drive you to the airport. 

II. PI believes that PI can take responsibility for Pi's part of the action. 

-R6.5- Can I drive you to the airport? 

-R6.6- I can drive you to the airport. 

-R6.7- 1 assume I can drive you to the airport. 

III. PI is willing to take responsibility for Pi's part of the action. 

-R6.3- I'm more than willing to drive you to the airport. 
-R6.3- I'd be glad to drive you to the airport. 

IV. PI wants P2 to perform some action that complements Pi's part of the action. 
-R6.3- 1 want you to accept a ride to the airport. 



76. This form seems to be marginal due to the conflict between its angry connotations and 
the level of politeness involved in an OFFER. 

77. This gap is explained by the fact that the principal intended effect for OFFER can be 
brought about only by the speech act; there is no independent means of achieving it. Thus, 
Clause iii of Rule 6.1 can never hold. 
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V. PI believes P2 can perform some action that complements Pi's part of the action. 

-R6.S- Cariyou accepta rick to the airport? 
-R6 6- You can accept a ride to the airport. 
-R6.7- I assume you can accept a ride to the airport. 

VI. PI believes that P2 would, be wiMing 1o per form some action that complements 

Pi's part of the action. 

-R6 4 Would you be willing to accept a ride to the airport? 

VII PI believes that P2 has an obljgjttoM?o !^^f^ftt«>! , l perform some action 
that complements Pi's part of the action. 

-R65 GAP 

-R6.6- You must accept a ride to the airport. 

-R6.7- CAP 

VIII PI believes that P2 has an obligation to P2 (by virtue c»f P2V own serf-interest) 
to perform some action that complements Pi's part of the .action. 

-R65- GAP 78 

-R66- That suitcase is heavy. You should let me drive you to the airport. 
-R6 7- That suitcase is heavy. I thin*, ypa should fct me drive you to the airport. 

IV -VII. together: 

-R&.8- Please accept a ride to the airport 
-R6.9- Will you accept a ride to the airport? , . <, 



78. See discussion for ASK V. 
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SUGGEST 

prepositional content: an action 

except for: actions in which PI and P2 share common agency 

simple forms: 

I. What about <action>? eg. What about joining the Marines? 
2 How about <action>? eg.. How about Joining the Marines? 

principal intended effect: that P2 consider taking responsibility for the action in the 
propositional content. 

Rule 61 

-i- Need I suggest that you join the Marines? 
-ii-T Has anyone suggested that you join the Marines? 
-iii- Are you thinking about joining the Marines? 
-iv- Have you considered joining the Marines? 

Rule 6 .2 

— I'd be pleased if you'd consider joining the Marines. 

Precondition-based Examples 

I. PI wants P2 to consider taking responsibility for the action. 
-R6 3- I want you to think about joining the Marines. 

II. PI believes that P2 can consider taking responsibility for the action. 

-R6.5- Could you think about joining the Marines? 
-R6.6- You could think about joining the Marines. 
-R6.7- 1 think you could consider joining the Marines. 

III PI believes that P2 is willing to consider taking responsibility for the action. 

-R6 4- Are you willing to consider joining the Marines? 
-R6.4- Do you want to think about joining the Marines? 

IV PI believes that P2 has an obligation (to "be helpful") to PI to consider the 
action. 

-R6.5- GAP 79 

-R6.6- You should think about joining the Marines. 



79. See discussion for ASK V. 
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-R6.7- 1 think you should consider joining the Marines. 
V. PI believes that P2 can take responsibility for the action. 

-R6.5- Can you join the Marines? 
-R6.6- You can join the Marines. 
-R6.7- 1 assume you 4 can join the Marines. 

VI PI believes that W is willing ; to take responsibility for ^he" action. 

-R6.4- Are you willing to join the Marines? 
-R6.4- Would you be willing to join the Marines? 

VII PI believes that there are some reasons why the action is desirable. 

-R65- Would it be good for ypji id join the Mar^s? ' 

-R6.6- It would be good for you tojoitjtfie Warmest 

-R6.6- You'd be a credit to your ^ 

-R6.7- I believe it would be good^fbVy^tejoHi the j Moines. 

VIII. PI believes that P2 has an^ obligation to P2 (by yirtue of P2's own selHntemt) 
to considei taking responsibility for theaction. 

-R6.5- GAP 80 

-R6.6- You need a new experience. You should jwn the Marines, 

-R6.7- You need a new experience 1 thitii you should ' joiii tne Marines. 

I.-1V. together: 

-R6.8- Think about joining the Marines. 
-R6.9- Wilt you consider Joining the Marines? 



80. See discussion for ASK V. 
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Aj^pnHjvJY JRHatinp; Rule 7.6 to OW1.-I Methods 

The notion of a basis for action defined in Section 7.5.3 can be related to the general 
structure of actions discussed in Section 4. By doing so, we get an additional level of detail 
for the definition of "basis for action" and hence a sounder understanding of Rule 7.6 and 
of type 3 implicit-action forms. Moteover, it should be easy to see that the additional level 
of detail has strong relevance for a computational treatment of these forms. I start with a 
look at undesirable states and then go on to look at signals and symptoms. 

Undesirable states that refer to ongoing methods (i.e. actions in progress) correspond 
to method failure conditions For example. D.I. which reports a failure in an ongoing 
copying process may be used to REQUEST that P2 fix the copier. 

D.l The paper jammed 

Other undesirable states (eg.. "It's cold in here") will match the negation of the 
PRINCIPAL-RESULT or other results of some method known to P2. (We expect the 
method(s) to be known to P2 due to Pi's obligation to avoid ambiguity; see Section 6.1.) In 
addition, I suspect that very frequently undesirable states will correspond to an initial 
condition 81 in a special planning method. For example, there might be a special planning 
method that contained information on "what to do when the room is cold." including 
alternative actions and the tradeoffs between them. (For further discussion of these 
methods, see Section 12.) 



81. This could possibly be represented in a semantic input case called INITIAL- 
CONDITION which would be defined as a specialization of SOURCE, i.e.. the point 

started from. 
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This takes care of the relationship of undesirable Mates to OWI,-I methods. The 
handling of signals and symptoms differs according to whether or not they are related to 
methods already underway tn the dialogue encounter Signals and symptoms related to 
ongoing methods will appear in a conditional step For example, a take-baking method 
that uses an oven timer will have a conditional step corresponding to the following: 

When the buzzer of the oven timer goes off, check to see if the cake is done. 

That is. given the signal, perform an action Similarly, whether «r not a cake method uses 
a signal, the following sort of conditional step might appear: 

When the cake starts to smell, check to see if it is done. 

Again the same type of pattern: given a symptom, perform an action. In addition to the 

simple presence of the states and actions in the method, we might want to label them 

explicitly as symptoms or signals Note that the conditional steps referred to in type 3 

implicit-actions may appear not only in the most general ongoing methods but also in 

substeps of them (which are themselves, it course, ongoing methods). Thu$, not only the 

steps in the bake-cake method might boused as a basis for type 3 IS As but aho steps m the 

mix -batter or grease-pan methods. 

For signals and symptoms unrelated to ongoing methods, it appears that one needs an 

additional set of representations in the knowledge base. These would contain information 

such as that for the following example signal: 

The fire alarm is a signal of a fire or a drill. 
When it goes off. leave the building immediately. 
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i.e <signal> is a signal of <event or process>. 
When <signal> happens <carry out an actionx 



A comparable example for symptoms is: 

PI shivering is a symptom of Pi's being cold, 
i.e <symptom> is a symptom of <state> 



Note that symptoms, especially, are not necessarily one-to-one. That is. shivering may also 
be a symptom of other states, e.g.. fright. Second, I am expecting the fact that it is 
undesirable for a person to be cold to be independently determinable from other 
information in the knowledge base. 

''.'■'\"'. t 

In summary, to relate type 3 implicit-action forms to OWI.-l methods, we have called 
on a number of different structures. Failure conditions. PRINCIPAL-RESULTs and other 
results, and states in conditional steps were used to represent the undesirable states, signals, 
and symptoms from Rule 7.6. In addition, independent structures were suggested for some 
uses of signals and symptoms. 



This empty page was substituted for a 
blank page in the original document. 
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Aj^nHiv^F- A Sum mary of Represent? '*"" I "->< i" the General Recognition Mode j 

The system configuration discussed in Section 9.1 presupposes various types of 
semantic representations. In this appendix I collect the different representations and show 
briefly how they relate to each other. This list is suggested as the minimal set of 
representations for a general recognition process. 

■^urfaf^j^antjc_Re^r_e_sentation(SSR) 

Th,s representation output by the parser closely follows the surface English utterance. Its 
salient features are-. 

- Noun group references not needed for the parsing process may remain unresolved 

(e.g., "I saw him"). 

- Systematically ambiguous relationships between constituents may be unresolved if 
not needed for parsing. For example, "He was hit *, the dec, Here *, .s 
systematically ambiguous between the agent or cause of the action in a passive 
construction (i.e. the door made contact with the person) and a ^»™ m y k " "*„ 
the door is viewed as a place). The fact that the tecattve is less likely as an 
interpretation does not mean that it can be ignored as a possible reading. 

- Choices between word senses need not be made unless, again, they are forced by the 
parsing process. For example, in "I get it" get may be synonymous with ^dentand 
or if « refers to a journal, get couM be synonymous with rtcHvt or even f"«bet* 
The SSR. however, would contain only a concept corresponding to get unless further 
specialization were necessary for a completed parse. 

- ISA forms are preserved. That is, "Can you open the window?" would always have 
a SSR that recorded its interrogative nature and that contained a concept 
corresponding to can. Representation of any REQUEST intentions would not occur 
at this level. 
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Interpre ter Level Representation (ILR) 

This * the main level of representation used by the Interpreter, methods are among 
the structures represent at the level ILR has the foxing characteristics: 

- It is a case-frame oriented representation. The core set of cases are described in 
Section 4. 

- Among its constituent concepts ILR may contain variables. The use of these is 
discussed for the next representation type. 

Evaluated TLR 

This is an ILR with the appropriate other pieces of ILR bound to its variables 
Evaluation allows general pieces of ILR to be related to current environments Evaluated 
ILR is output by the module Evaluate and is used, among other places, in the structural 
expectations (see Section 9.2). 

For example, if an order-fast-foed procedure #ep wen? evaluated for three Big 

Burgers and a person called Julia, then the concept for Julia would be bound to the 

variable for the AGENT of the call, and the concept for the Big Burgers would be bound 

to the variable in the OBJECT specification. We thus get a transformation from one 

representation to another with roughly the following information content: 

Rl: AGENT (a person) orders fast food from the Cp-AQENT (a person) 
R2: Julia orders three Big Burgers from Counter-person-l 

The evaluated call representation of R2 can be used to search for an appropriate method to 
carry out the procedure 
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Transitional Representation 

Transitional representation is the level of representation that appears in special 
matching patterns, some of which are discussed in Section ll. 82 This representation is 
called "transitional" because it bridges the gap between SSR and ILR. Transitional 
representations share case frame structuring and the use of variables with ILR, but they 
share sentence-type marking (declarative, interrogative, imperative) and indirect forms with 
SSR Each transitional representation has an associated ILR, and a successful match of a 
SSR against the transitional representation (eg., by the reference matching process) 
automatically binds the variables in the ILR. 

Other Representations 

It is quite possible that some semantic domains will require other sorts of 
representations. While the units of the current ILR are not to be construed as lexical items, 
ILR is intentionally linguistically oriented There would no doubt be a need for quite 
different representations, e.g. tabular information and mathematical formulas. The 
representations summarized in this section, then, are seen as a minimum, not necessarily a 
complete set. 



82. This representation type was called "subsurface level representation" in earlier versions 
of the model. 



